suyashb95 / WiktionaryParserLinks
A Python Wiktionary Parser
☆367Updated 3 months ago
Alternatives and similar repositories for WiktionaryParser
Users that are interested in WiktionaryParser are comparing it to the libraries listed below
Sorting:
- Wiktionary dump file parser and multilingual data extractor☆1,030Updated this week
- Gather modern English word frequencies from all enwiki articles.☆226Updated last year
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆159Updated 10 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last week
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 10 months ago
- A modern, interlingual wordnet interface for Python☆266Updated last month
- A cloud-based, open-source system for writing and publishing dictionaries.☆95Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆177Updated 4 months ago
- The Open English WordNet☆643Updated 3 weeks ago
- LingPy: Python library for quantitative tasks in historical linguistics☆137Updated 3 months ago
- A library for fetching and reading Tatoeba's weekly exports☆24Updated last year
- Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French, Italian, Portuguese and Romanian and can predict conjugation for u…☆98Updated this week
- Sentence aligner☆118Updated 4 years ago
- Universal Dependencies online documentation☆287Updated this week
- A list of vocabulary lists☆22Updated 5 years ago
- A Python library to parse MediaWiki WikiText☆314Updated 5 months ago
- Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.☆807Updated last week
- Bitextor generates translation memories from multilingual websites☆296Updated 11 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆255Updated 2 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆61Updated 10 years ago
- Machine-readable Wiktionary☆77Updated last year
- Open morphology for Finnish☆96Updated 2 months ago
- A multilingual parallel corpus created from translations of the Bible.☆190Updated 5 months ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆75Updated last month
- A tokenizer and sentence splitter for German and English web and social media texts.☆148Updated 10 months ago
- Proxy to convert HTML responses from linguee.com to JSON format☆204Updated last year
- hand-written dictionaries from the FreeDict project☆443Updated 3 months ago
- A Python parser for MediaWiki wikicode☆838Updated 3 months ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆32Updated 6 years ago