bilingual dictionary extractor from parallel corpora
☆23Jul 3, 2014Updated 11 years ago
Alternatives and similar repositories for hundict
Users that are interested in hundict are comparing it to the libraries listed below
Sorting:
- Wiktionary parser tool for many language editions.☆54Aug 17, 2022Updated 3 years ago
- a sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Models☆22Jan 18, 2016Updated 10 years ago
- Code for EMNLP 2016 paper "Equation Parsing : Mapping Sentences to Grounded Equations"☆12Jun 28, 2017Updated 8 years ago
- Open Source Neural Machine Translation in PyTorch☆13Apr 29, 2023Updated 2 years ago
- small python app to help practice speech shadowing, helpful for language learning☆13Jun 25, 2020Updated 5 years ago
- Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations☆15Jan 6, 2017Updated 9 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Jan 24, 2017Updated 9 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- Allows language communities to build their own dictionaries. Development is tracked at https://jira.sil.org/projects/WS☆19Jan 30, 2026Updated last month
- MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation o…☆22Oct 29, 2017Updated 8 years ago
- Source stories from the African Storybook Project in Markdown format☆22Jan 25, 2026Updated last month
- Microsoft Speech Language Translation (MSLT) Corpus☆19Sep 18, 2017Updated 8 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- ☆20Aug 17, 2021Updated 4 years ago
- The repository for the paper: Rethinking Document-level Neural Machine Translation☆25Dec 20, 2022Updated 3 years ago
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆24Oct 13, 2023Updated 2 years ago
- ☆21Dec 9, 2016Updated 9 years ago
- Thot toolkit for statistical machine translation☆53Nov 11, 2022Updated 3 years ago
- automate incrementally producing word pronunciation recordings for Wiktionary through Wikimedia Commons☆22Apr 18, 2018Updated 7 years ago
- Creates dictionary files from Wiktionary data☆30Aug 21, 2025Updated 6 months ago
- ☆29Dec 2, 2024Updated last year
- An app that graphs and compares the pitch contours of spoken language, to help language learners perfect their intonation (Hackbright Spr…☆30Jul 20, 2017Updated 8 years ago
- A multilingual lexical and semantic resource that links words of natural languages to abstract semantic concepts. Also called U++ Common …☆29Sep 25, 2025Updated 5 months ago
- Deep Learning systems for training and testing disfluency detection and related tasks on speech data.☆61Nov 21, 2025Updated 3 months ago
- AutoCorpus is a set of utilities that enable automatic extraction of language corpora and language models from publicly available dataset…☆37Feb 1, 2012Updated 14 years ago
- Search comments and highlights annotations in PDF documents.☆12May 4, 2023Updated 2 years ago
- ICU based universal language tokenizer☆34Jan 19, 2022Updated 4 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 3 years ago
- ☆31Mar 7, 2017Updated 9 years ago
- Create PDFs (A4 format) for practicing Chinese character writing. Completely written in HTML, CSS, Javascript (with jQuery).☆41Apr 1, 2019Updated 6 years ago
- A library for extracting and parsing Wikipedia talk pages☆13Apr 20, 2017Updated 8 years ago
- Minangkabau NLP corpus. PACLIC 2020☆10Jun 7, 2021Updated 4 years ago
- Speech Recognition implementation using Artificial Neural Networks☆10Sep 7, 2015Updated 10 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35May 5, 2023Updated 2 years ago
- A LibreOffice extension that converts JabRef references to plain text code and vice versa so that you can use your references with MS Off…☆12Aug 15, 2024Updated last year
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Oct 14, 2022Updated 3 years ago
- python script for voice activity detection.☆36Aug 16, 2024Updated last year
- Tools for working with the CMU Pronunciation Dictionary☆36Sep 5, 2017Updated 8 years ago
- Install Kubo (go-ipfs) from NPM☆44Updated this week