LBeaudoux / tatoebatoolsLinks
A library for fetching and reading Tatoeba's weekly exports
☆24Updated 2 years ago
Alternatives and similar repositories for tatoebatools
Users that are interested in tatoebatools are comparing it to the libraries listed below
Sorting:
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated 2 weeks ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆161Updated 11 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆65Updated last week
- A Python library to parse MediaWiki WikiText☆317Updated 6 months ago
- A Python Wiktionary Parser☆367Updated 4 months ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆74Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆180Updated 6 months ago
- Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French, Italian, Portuguese and Romanian and can predict conjugation for u…☆97Updated 2 weeks ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆55Updated 4 months ago
- A modern, interlingual wordnet interface for Python☆276Updated this week
- The Language Learning Toolkit (LLTK) performs a variety of tasks useful for (human) language learning.☆41Updated 6 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Open source, updated Whitaker's Words Latin Dictionary and Morphology in Python☆59Updated 8 years ago
- Open morphology for Finnish☆95Updated last week
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆30Updated 2 years ago
- LingPy: Python library for quantitative tasks in historical linguistics☆138Updated last week
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆78Updated 7 months ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆97Updated last year
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆77Updated last week
- A Python library for working with and comparing language codes.☆353Updated 7 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆75Updated 3 months ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Collaborative data curation for Glottolog☆178Updated this week
- A jisho.org API made in Python☆89Updated 9 months ago
- The World Atlas of Language Structures☆72Updated last year
- Hy-phen-ation made easy☆217Updated 9 months ago
- A Python parser for MediaWiki wikicode☆849Updated 5 months ago
- universal syllabification algorithms☆45Updated 2 years ago