mikekestemont / ruzicka
☆12Updated last year
Alternatives and similar repositories for ruzicka:
Users that are interested in ruzicka are comparing it to the libraries listed below
- R package for stylometric analyses☆188Updated 2 months ago
- Perseus Treebank Data☆72Updated 9 months ago
- A set of utilities for processing MediaWiki XML dump data.☆52Updated last month
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Simplified version of a common crawl fetcher☆13Updated last week
- The curation repository for the data behind Concepticon.☆38Updated last month
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- JS / Python3 / PHP Lib to work with UTF8 polytonic greek and latin☆10Updated 6 months ago
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- Linguistic Reconstruction with LingPy☆13Updated 7 months ago
- The Tesserae project aims to provide a flexible and robust web interface for exploring intertextual parallels. Select two poems below to …☆31Updated 5 months ago
- Trained taggers, tokenizers, etc. for the CLTK☆9Updated 3 years ago
- jq module to process Wikidata JSON format☆11Updated 5 years ago
- High-performance text aligner for large collections of texts☆50Updated 5 months ago
- Z39.50/SRU router☆16Updated 2 weeks ago
- CollateX – Software for Collating Textual Sources☆92Updated last year
- Latin BERT☆60Updated 9 months ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆64Updated last week
- Coptic NLP pipeline page and utilities☆14Updated last month
- Python 3 library for accenting (and analyzing the accentuation of) Ancient Greek words☆56Updated 3 years ago
- In-browser OCR of Ancient Greek and Latin☆26Updated this week
- PhiloLogic4☆38Updated 3 months ago
- GLEM is a lemmatizer for Ancient Greek.☆24Updated last year
- Data from the Integrating Digital Papyrology project☆66Updated this week
- ☆58Updated last month
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 10 months ago
- An approximate nearest-neighbor search for text reuse.☆12Updated 4 years ago
- A multilingual parallel corpus created from translations of the Bible.☆178Updated 6 months ago