Course 1: Text Recognition
Andreas Fischer · HES-SO / University of Fribourg
Lecturer

Andreas Fischer is a Full Professor at the University of Applied Sciences and Arts Western Switzerland (HES-SO Fribourg) and a Lecturer at the University of Fribourg, where he leads the PARAM research group within the iCoSys Institute. His research focuses on pattern recognition, machine learning, and document image analysis, with a particular emphasis on handwritten text recognition for historical documents. Fischer has made significant contributions to automatic reading of medieval manuscripts and early modern scripts, advancing methods that make digitized cultural heritage computationally accessible. He has been involved in major projects developing AI-supported tools for large volumes of archival and library materials.
🤖 Bio generated by AI from public academic profile. Homepage · ORCID
Lecture Overview
Overview
Fischer presents text recognition as a key access technology for written cultural heritage. His lecture moves from the motivation for automatic reading of historical documents to the sequence of tasks involved: locating text on a page, recognizing it line by line, and making it searchable afterwards.
Main Points
- Text recognition matters because digitized images alone are not enough; researchers also need machine-readable text for search, navigation, and large-scale access.
- The workflow begins with layout analysis, continues with line-level recognition, and often ends with retrieval tasks such as keyword search.
- Fischer distinguishes between full transcription, keyword spotting, and transcription alignment, and explains that they require different amounts of training data and different levels of difficulty.
- Modern print OCR is close to solved, while historical handwriting remains much harder because scripts, layouts, and languages vary strongly across sources.
- The lecture frames machine learning as the practical basis for current systems, but stresses that success depends on training material, document structure, and the target use case.
Examples Mentioned
- Medieval manuscripts
- Papyri and engraved inscriptions
- Historical correspondence
- Abbey Library material and other heritage collections
Source transcript: transcripts/Course 1_Fischer_TextRecognition.txt
Further Reading
See Zotero collection for 5 selected publications by this lecturer.