LBeaudoux / tatoebatools
A library for fetching and reading Tatoeba's weekly exports
☆22Updated last year
Alternatives and similar repositories for tatoebatools:
Users that are interested in tatoebatools are comparing it to the libraries listed below
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- The Language Learning Toolkit (LLTK) performs a variety of tasks useful for (human) language learning.☆41Updated 5 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆61Updated this week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆98Updated last week
- IPA Pronunciation Dictionaries in DSL format☆39Updated 8 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 5 years ago
- A jisho.org API made in Python☆76Updated 2 weeks ago
- Add sentences to Anki editor window in one click☆15Updated 8 months ago
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆50Updated last year
- Anki add-on to look up vocabulary using Wiktionary☆18Updated 2 weeks ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆71Updated 3 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆48Updated 4 months ago
- British English pronunciation dictionary☆92Updated 7 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Python Finite-State Toolkit☆53Updated 2 weeks ago
- Python API to access glottolog/glottolog☆29Updated 4 months ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated 11 months ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆23Updated 4 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆153Updated 3 months ago
- Cython wrapper on Hunspell Dictionary☆67Updated 8 months ago
- Tools for creating DSL-format dictionaries☆14Updated 3 years ago
- German part-of-speech dictionary☆43Updated last year
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆105Updated last month
- Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.☆12Updated 3 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆75Updated 6 months ago