gambolputty / wiktionary-de-parserLinks
Extract data from German Wiktionary XML files.
☆26Updated 6 months ago
Alternatives and similar repositories for wiktionary-de-parser
Users that are interested in wiktionary-de-parser are comparing it to the libraries listed below
Sorting:
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆153Updated 6 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆104Updated last month
- A Python Wiktionary Parser☆361Updated 4 months ago
- German part-of-speech dictionary☆45Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆56Updated 10 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆167Updated last month
- Browser extension adding shortcuts to DWDS queries☆8Updated 6 months ago
- Tools for creating DSL-format dictionaries☆15Updated 3 years ago
- Tools for professional translators running GNU/Linux☆31Updated 3 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆76Updated 3 months ago
- Python scripting utility for SIL FieldWorks Language Explorer (FLEx)☆17Updated this week
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆25Updated last year
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 7 months ago
- A list of vocabulary lists☆21Updated 5 years ago
- Wiktionary dump file parser and multilingual data extractor☆950Updated this week
- 🏆 • 5050 most frequent words in 109 languages☆43Updated 2 years ago
- A library for fetching and reading Tatoeba's weekly exports☆24Updated last year
- Trained taggers, tokenizers, etc. for the CLTK☆10Updated 3 years ago
- Java Wiktionary Library☆57Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Complete Conjugation of any Verb(e) in Catalan, French, Italian, Portuguese, Romanian or Spanish and conjugate unknown verbs using Machin…☆90Updated last year
- Perseus Treebank Data☆72Updated last year
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated last year
- An OCR evaluation tool☆66Updated 2 months ago
- Anki add-on to look up vocabulary using Wiktionary☆19Updated 4 months ago
- Machine-readable Wiktionary☆76Updated last year
- Bitextor generates translation memories from multilingual websites☆294Updated 8 months ago
- A modern, interlingual wordnet interface for Python☆254Updated last week