Resources

This page showcases research tools I've built for digital philology, Slavic text processing, and corpus linguistics.

Digital Philology Tools

Dostoevsky Search available

A full-text search interface for the works of Fyodor Dostoevsky, supporting exact word forms, lemmatized queries, and multi-word phrases. Results include numbered occurrences, expandable context windows, and detailed statistical analysis (IPM, CV, Gini coefficient, collocations, and word forms), along with navigation across structural text divisions. The interface also includes a single-text search mode, allowing users to open a full work and navigate directly between occurrences within the text.

Search modesexact forms, lemmas, phrases
Results per page50 occurrences
Preview~1,500 characters
Extended context~30,000 characters
StatisticsIPM, CV, Gini, collocations (±3), word forms
Collocation filterstopwords (on/off)

Built with Flask and pymorphy2 for lemmatization and morphological analysis of Russian word forms. The system automatically detects text structure based on markup.

Reference edition: Достоевский, Ф. М. (1989–1996). Собрание сочинений в 15 томах. Наука, Ленинградское отделение.

Text Processing Tools

RUslav: Cyrillic Keyboard for Windowsavailable

A specialized phonemic Cyrillic keyboard layout designed for German Slavic Studies. It allows for seamless input of Cyrillic characters based on the standard German Latin (QWERTZ) layout. The system supports both modern Slavic languages and historical scripts, specifically tailored for Old Church Slavonic and Church Slavonic philological work.

DocxSectioner available

A desktop tool to split large Word documents into separate files based on heading styles. It is designed for managing thesis chapters, corpus segments, or long linguistic manuscripts.

The utility preserves all original formatting, tables, and images, making it a reliable solution for granular text analysis and modular editing of complex academic works.

DocxMelter available

A companion tool for merging multiple .docx files into a single master document. It automates the assembly of fragmented research papers or edited chapters into a unified monograph.

It ensures consistent section breaks and page numbering across the entire batch, providing a structurally sound final document for publication or submission.

Transliteration Tools

Online Transliterationavailable

A browser-based tool for scholarly transliteration of Cyrillic text into Latin script. The service follows the scientific transliteration tradition commonly used in Slavic linguistics and widely recognized in German-speaking academic contexts.

Supported languages include Russian, Ukrainian, Belarusian, Bulgarian, Serbian, Macedonian, and Old Church Slavonic. The interface transliterates only the relevant Cyrillic characters and leaves other scripts unchanged.

The editor preserves basic formatting such as bold, italics, line breaks, and letter case, which makes it useful for philological work, teaching materials, and quick preparation of publication-ready text.

Macros for Russian and Ukrainian Transliteration available

A Microsoft Word–based tool for transliterating Russian and Ukrainian text directly inside Word documents according to the scholarly transliteration convention widely used in German-language Slavic studies.

Distributed as a Word template with VBA macros, the tool is designed to preserve ordinary document formatting during transliteration, including character styling and paragraph layout.

This makes it suitable for revising existing documents, preparing handouts and teaching materials, and converting formatted text for academic use without moving it into a separate editor.

Contact

name.surname [at] ur.de