LBeaudoux / tatoebatoolsLinks
A library for fetching and reading Tatoeba's weekly exports
☆24Updated last year
Alternatives and similar repositories for tatoebatools
Users that are interested in tatoebatools are comparing it to the libraries listed below
Sorting:
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆105Updated last week
- A Python Wiktionary Parser☆362Updated 2 weeks ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆155Updated 7 months ago
- Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.☆782Updated last week
- Wiktionary dump file parser and multilingual data extractor☆964Updated last week
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 8 months ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆51Updated this week
- A list of vocabulary lists☆21Updated 5 years ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆76Updated 3 months ago
- A modern, interlingual wordnet interface for Python☆255Updated last month
- Open morphology for Finnish☆92Updated 3 months ago
- A jisho.org API made in Python☆84Updated 5 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆56Updated 10 years ago
- The Language Learning Toolkit (LLTK) performs a variety of tasks useful for (human) language learning.☆41Updated 5 years ago
- The World Atlas of Language Structures☆61Updated 9 months ago
- A half-automatic Forvo downloader addon for Anki☆19Updated 2 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆64Updated last week
- A Python library for working with and comparing language codes.☆345Updated 3 months ago
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆25Updated last year
- Complete Conjugation of any Verb(e) in Catalan, French, Italian, Portuguese, Romanian or Spanish and conjugate unknown verbs using Machin…☆90Updated last year
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Open source, updated Whitaker's Words Latin Dictionary and Morphology in Python☆56Updated 7 years ago
- Python 3 library for manipulating Jim Breen's JMdict, KanjiDic2, JMnedict and kanji-radical mappings☆143Updated 4 years ago
- 🍙 An Anki add-on that makes your images small.☆29Updated 4 months ago
- Anki add-on to look up vocabulary using Wiktionary☆20Updated 5 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆53Updated 4 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- ☆15Updated last week
- A MorphMan fork rebuilt from the ground up with a focus on simplicity, performance, and a codebase with minimal technical debt.☆92Updated this week