Converting your text base
As part of full-text indexing, the text contained in the digitised document is automatically recognised and converted into a machine-readable form. In addition, structural elements such as paragraphs, headings or highlights can be identified and displayed. In addition to the text itself, information about the internal structure of the document – outline, headings, paragraphs – and typographical highlights – italics or underlining – are also captured.
Advantages
- Quick content indexing
Full texts can be searched, e.g. by keywords, people or places – without manually leafing through digitised or printed sources. - Data collection for research projects
Text data can be extracted, filtered and prepared for further analysis, e.g. for visualisations, network analyses or annotations. - Archiving and long-term use
The digital format allows for the sustainable preservation and reusability of text sources regardless of their original formats. - Basis for digital editions
Structured texts can be used as a basis for editing or indexing projects. - Integration into repositories or platforms
Indexed texts can be integrated into existing digital infrastructures, e.g. for open access or subject-specific databases. - Accessibility
Digitally processed content, e.g. text-to-speech, makes it easier for people with visual impairments to access content.