gambolputty / wiktionary-de-parserLinks
Extract data from German Wiktionary XML files.
☆26Updated 8 months ago
Alternatives and similar repositories for wiktionary-de-parser
Users that are interested in wiktionary-de-parser are comparing it to the libraries listed below
Sorting:
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆158Updated 8 months ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆51Updated last month
- A Python Wiktionary Parser☆363Updated 2 months ago
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆27Updated 2 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- German part-of-speech dictionary☆45Updated 2 years ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆77Updated 5 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆57Updated 10 years ago
- Tools for creating DSL-format dictionaries☆15Updated 3 years ago
- Tools for professional translators running GNU/Linux☆32Updated 3 years ago
- Python scripting utility for SIL FieldWorks Language Explorer (FLEx)☆18Updated 2 weeks ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 9 months ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Java Wiktionary Library☆58Updated 2 years ago
- Trained taggers, tokenizers, etc. for the CLTK☆10Updated 3 years ago
- tesseractXplore a tesseract ease of use gui with full control☆24Updated 3 years ago
- Perseus Treebank Data☆73Updated last year
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- Ebook reader dictionaries extracted from Wiktionary in almost all languages, in Stardict, Tabfile and Kindle format☆110Updated 2 years ago
- Data from the Integrating Digital Papyrology project☆67Updated this week
- A repository of words in multiple languages sorted by their frequency☆12Updated 2 years ago
- hand-written dictionaries from the FreeDict project☆435Updated 2 months ago
- Wiktionary dump file parser and multilingual data extractor☆1,003Updated this week
- A library for fetching and reading Tatoeba's weekly exports☆24Updated last year
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Anki add-on to look up vocabulary using Wiktionary☆22Updated 6 months ago
- Libraries and command-line tools for metrical analysis of epic Greek hexameter☆28Updated 7 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆73Updated this week
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆17Updated 10 months ago