gambolputty / wiktionary-de-parserLinks
Extract data from German Wiktionary XML files.
☆26Updated last week
Alternatives and similar repositories for wiktionary-de-parser
Users that are interested in wiktionary-de-parser are comparing it to the libraries listed below
Sorting:
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆160Updated 11 months ago
- German part-of-speech dictionary☆45Updated 2 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆29Updated 2 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆61Updated 10 years ago
- A Python Wiktionary Parser☆367Updated 4 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated this week
- Tools for creating DSL-format dictionaries☆15Updated 3 years ago
- Java Wiktionary Library☆58Updated 3 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆74Updated 11 months ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Offline etymological dictionary based on Wiktionary data☆22Updated 3 years ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆20Updated 4 years ago
- Machine-readable Wiktionary☆77Updated last year
- Trained taggers, tokenizers, etc. for the CLTK☆10Updated 3 years ago
- Tools for professional translators running GNU/Linux☆32Updated 3 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆350Updated 3 years ago
- Morphological Dictionaries for German Language☆30Updated 7 years ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆78Updated 7 months ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆47Updated last year
- Lexical data at Unicode☆70Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆179Updated 5 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Python scripting utility for SIL FieldWorks Language Explorer (FLEx)☆18Updated 2 months ago
- A library for fetching and reading Tatoeba's weekly exports☆24Updated last year
- ☆28Updated last year
- Perseus Treebank Data☆75Updated last year
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- Python Library and CLI for the LanguageTool JSON API☆140Updated 7 months ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆96Updated last year