For DocEng26, the NOMOCRAT project investigators organised a competition about creating an OCR that can extract Maltese text from an image in paragraph form (not as lines of text).
To participate:
--extra-index-url line if you are not using GPUs).competition_evaluator.py to test that everything works.competition_transcriber.py and running competition_evaluator.py (see comments in examples scripts for instructions).competition_transcriber.py with your own solution to the competition task (according to the rules linked below).competition_transcriber.py (that downloads the model from HuggingFace) and, if needed, a requirements.txt file with additional packages to install apart from what is already mentioned in the assets.