fildpauz / vocab-listsLinks
A list of vocabulary lists
☆22Updated 5 years ago
Alternatives and similar repositories for vocab-lists
Users that are interested in vocab-lists are comparing it to the libraries listed below
Sorting:
- Sentence aligner☆120Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last week
- Improved Sentence Alignment in Linear Time and Space☆185Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 4 months ago
- ☆65Updated 2 months ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆32Updated 6 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆39Updated last year
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆84Updated 2 weeks ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆51Updated 2 years ago
- Morphological Dictionaries for German Language☆30Updated 7 years ago
- A modern, interlingual wordnet interface for Python☆272Updated this week
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- Efficient Low-Memory Aligner☆146Updated 10 months ago
- Bitextor generates translation memories from multilingual websites☆296Updated last year
- LingPy: Python library for quantitative tasks in historical linguistics☆138Updated 4 months ago
- Translation Memory Open-source Purifier☆34Updated 3 years ago
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- The Open Multilingual Wordnet☆65Updated last year
- Python Finite-State Toolkit☆60Updated last week
- Multilingual sentence alignment using sentence embeddings☆130Updated last year
- Lexical database for ~70k English words with morphological variables☆47Updated 3 years ago
- A character-wise tokenizer for morphologically rich languages☆29Updated last month
- A cloud-based, open-source system for writing and publishing dictionaries.☆96Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Updated last year
- ☆78Updated 3 months ago
- OpusFilter - Parallel corpus processing toolkit☆112Updated last week
- Helsinki Finite-State Technology (library and application suite)☆136Updated 3 weeks ago
- A Python Wiktionary Parser☆367Updated 3 months ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆28Updated 5 years ago