suyashb95 / WiktionaryParser
A Python Wiktionary Parser
☆360Updated last year
Alternatives and similar repositories for WiktionaryParser:
Users that are interested in WiktionaryParser are comparing it to the libraries listed below
- Gather modern English word frequencies from all enwiki articles.☆207Updated 10 months ago
- A modern, interlingual wordnet interface for Python☆229Updated last month
- Wiktionary dump file parser and multilingual data extractor☆847Updated this week
- Sentence aligner☆109Updated 3 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated this week
- A python module for English lemmatization and inflection.☆265Updated last year
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆149Updated 2 weeks ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆70Updated last month
- Machine-readable Wiktionary☆74Updated 8 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated last month
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆234Updated 2 years ago
- Extract data from German Wiktionary XML files.☆26Updated 2 weeks ago
- Machine-Translation-based sentence alignment tool for parallel text☆304Updated 3 years ago
- hand-written dictionaries from the FreeDict project☆401Updated 3 months ago
- Multilingual sentence alignment using sentence embeddings☆106Updated 2 months ago
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 3 years ago
- LingPy: Python library for quantitative tasks in historical linguistics☆128Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 4 months ago
- Universal Dependencies online documentation☆278Updated this week
- A library for fetching and reading Tatoeba's weekly exports☆21Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆137Updated last month
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 3 months ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆89Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆45Updated last year
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆311Updated this week
- This packages up data for the Open Multilingual Wordnet☆44Updated 3 weeks ago