sebastian-nehrdich / sanskrit-tibetan-etextsLinks
This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.
☆15Updated 3 years ago
Alternatives and similar repositories for sanskrit-tibetan-etexts
Users that are interested in sanskrit-tibetan-etexts are comparing it to the libraries listed below
Sorting:
- High-performance text aligner for large collections of texts☆53Updated last week
- Morphological analyzer and lemmatizer for Latin.☆27Updated last week
- linguistics backend☆42Updated 2 years ago
- The Tesserae project aims to provide a flexible and robust web interface for exploring intertextual parallels. Select two poems below to …☆33Updated this week
- Linguistic search for large annotated text corpora, based on Apache Lucene☆116Updated this week
- Conversions between various OCR formats☆82Updated 2 years ago
- ☆32Updated 3 years ago
- CollateX – Software for Collating Textual Sources☆97Updated last year
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆21Updated 6 months ago
- Named entity annotation tool☆28Updated 2 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 3 years ago
- Python tools for performing various operations on ALTO XML files☆48Updated 9 months ago
- An Ancient Greek Morphology Tagger☆26Updated 2 years ago
- Ground Truth Resources for the HTR of patrimonial documents☆45Updated this week
- Arethusa: Annotation Environment☆36Updated 2 years ago
- A textual corpus database for the digital humanities.☆62Updated 5 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆58Updated 2 months ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆38Updated 2 years ago
- Detect and align similar passages☆111Updated 2 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- eXtensible Interlinear Glossed Text☆33Updated 3 years ago
- The CIS OCR PostCorrectionTool☆44Updated 3 years ago
- ☆13Updated 2 years ago
- Convert ALTO XML to plain text + minimal metadata☆17Updated last year
- The curation repository for the data behind Concepticon.☆40Updated last week
- Process, enhance and evaluate multiple OCR output.☆24Updated last year
- Ancient Greek language models for spaCy☆33Updated 8 months ago
- A module for Omeka S that provides an API for the Neatline 3 single page application☆17Updated 2 years ago
- Ergonomic line-by-line transcription of scanned text.☆54Updated 4 years ago