juditacs / wikt2dict
Wiktionary parser tool for many language editions.
☆54Updated 2 years ago
Alternatives and similar repositories for wikt2dict:
Users that are interested in wikt2dict are comparing it to the libraries listed below
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Transform TMX to text☆28Updated 2 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- bilingual dictionary extractor from parallel corpora☆22Updated 10 years ago
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 3 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 4 months ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- Python Finite-State Toolkit☆54Updated 2 months ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆28Updated 5 years ago
- English web corpus with 4M tokens and several annotation types☆26Updated last year
- ☆12Updated 9 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 11 months ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 10 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago
- The curation repository for the data behind Concepticon.☆38Updated this week
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- Efficient Low-Memory Aligner☆143Updated 3 months ago
- Java Wiktionary Library☆57Updated 2 years ago
- ☆23Updated 7 years ago
- Editor for aligned parallel texts (personal desktop application).☆19Updated 4 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Sentence aligner☆112Updated 3 years ago