Wiktionary parser tool for many language editions.
☆54Aug 17, 2022Updated 3 years ago
Alternatives and similar repositories for wikt2dict
Users that are interested in wikt2dict are comparing it to the libraries listed below
Sorting:
- bilingual dictionary extractor from parallel corpora☆23Jul 3, 2014Updated 11 years ago
- Creates dictionary files from Wiktionary data☆30Aug 21, 2025Updated 6 months ago
- Machine-readable Wiktionary☆78May 6, 2024Updated last year
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Oct 19, 2019Updated 6 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35May 5, 2023Updated 2 years ago
- automate incrementally producing word pronunciation recordings for Wiktionary through Wikimedia Commons☆22Apr 18, 2018Updated 7 years ago
- Gramadán: a computational grammar of Irish☆17Jan 23, 2023Updated 3 years ago
- A parser and autocorrection tool for wiktionary.☆39Dec 4, 2015Updated 10 years ago
- Scripts for preprocessing morfologik data.☆40Dec 2, 2017Updated 8 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Morphological analysis for Udmurt.☆12Feb 17, 2026Updated 2 weeks ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ACL Rolling Review website☆11Feb 24, 2026Updated last week
- Command-line corpus tools☆12May 15, 2017Updated 8 years ago
- X-SAMPA to IPA converter☆28Nov 6, 2020Updated 5 years ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆50Aug 16, 2023Updated 2 years ago
- A multilingual parallel corpus created from translations of the Bible.☆193May 19, 2025Updated 9 months ago
- Android app for learning genders of German nouns☆15Apr 14, 2023Updated 2 years ago
- Language checker and hyphenator extension for LibreOffice☆12Jan 27, 2020Updated 6 years ago
- project trying to replicate http://arxiv.org/pdf/1412.5567v2.pdf☆12Mar 22, 2015Updated 10 years ago
- Study on lexibank data (presenting the lexibank dataset).☆15Apr 11, 2025Updated 10 months ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆31Dec 15, 2014Updated 11 years ago
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆16Sep 20, 2023Updated 2 years ago
- Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.☆16Jul 22, 2013Updated 12 years ago
- Data from a corpus of written Hawaiian☆17Jun 27, 2016Updated 9 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Cross-Linguistic Transcription Systems☆17Dec 17, 2024Updated last year
- Extract data from German Wiktionary XML files.☆26Jan 8, 2026Updated last month
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆32Aug 16, 2023Updated 2 years ago
- ☆16Jan 20, 2022Updated 4 years ago
- Tools for creating DSL-format dictionaries☆15Feb 5, 2022Updated 4 years ago
- Python implementation of sinewave speech, as a command-line tool☆14May 30, 2020Updated 5 years ago
- Workshop bringing together individuals interested in developing curriculum, workflows, and tools to strengthen reproducibility in researc…☆33Jul 12, 2015Updated 10 years ago
- Bregman Labs’ Audiovisual Synthesis Tools☆16Mar 20, 2017Updated 8 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Apr 25, 2015Updated 10 years ago