zseder / hundictView external linksLinks
bilingual dictionary extractor from parallel corpora
☆23Jul 3, 2014Updated 11 years ago
Alternatives and similar repositories for hundict
Users that are interested in hundict are comparing it to the libraries listed below
Sorting:
- Wiktionary parser tool for many language editions.☆54Aug 17, 2022Updated 3 years ago
- a sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Models☆22Jan 18, 2016Updated 10 years ago
- Code for EMNLP 2016 paper "Equation Parsing : Mapping Sentences to Grounded Equations"☆12Jun 28, 2017Updated 8 years ago
- Open Source Neural Machine Translation in PyTorch☆13Apr 29, 2023Updated 2 years ago
- small python app to help practice speech shadowing, helpful for language learning☆12Jun 25, 2020Updated 5 years ago
- Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations☆15Jan 6, 2017Updated 9 years ago
- A radio for Wikimedia Commons audio files☆14Dec 28, 2020Updated 5 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- Social Media Machine Translation Toolkit☆21Sep 13, 2013Updated 12 years ago
- Allows language communities to build their own dictionaries. Development is tracked at https://jira.sil.org/projects/WS☆19Jan 30, 2026Updated 2 weeks ago
- MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation o…☆22Oct 29, 2017Updated 8 years ago
- Microsoft Speech Language Translation (MSLT) Corpus☆19Sep 18, 2017Updated 8 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- ☆20Aug 17, 2021Updated 4 years ago
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆24Oct 13, 2023Updated 2 years ago
- ☆21Dec 9, 2016Updated 9 years ago
- The repository for the paper: Rethinking Document-level Neural Machine Translation☆25Dec 20, 2022Updated 3 years ago
- Thot toolkit for statistical machine translation☆53Nov 11, 2022Updated 3 years ago
- automate incrementally producing word pronunciation recordings for Wiktionary through Wikimedia Commons☆22Apr 18, 2018Updated 7 years ago
- ☆29Dec 2, 2024Updated last year
- A multilingual lexical and semantic resource that links words of natural languages to abstract semantic concepts. Also called U++ Common …☆29Sep 25, 2025Updated 4 months ago
- Deep Learning systems for training and testing disfluency detection and related tasks on speech data.☆61Nov 21, 2025Updated 2 months ago
- AutoCorpus is a set of utilities that enable automatic extraction of language corpora and language models from publicly available dataset…☆37Feb 1, 2012Updated 14 years ago
- Reader Translator Generator - NMT toolkit based on pytorch☆32Sep 12, 2023Updated 2 years ago
- Search comments and highlights annotations in PDF documents.☆12May 4, 2023Updated 2 years ago
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Nov 20, 2015Updated 10 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67May 9, 2022Updated 3 years ago
- Create PDFs (A4 format) for practicing Chinese character writing. Completely written in HTML, CSS, Javascript (with jQuery).☆40Apr 1, 2019Updated 6 years ago
- ☆31Mar 7, 2017Updated 8 years ago
- Speech Recognition implementation using Artificial Neural Networks☆10Sep 7, 2015Updated 10 years ago
- Minangkabau NLP corpus. PACLIC 2020☆10Jun 7, 2021Updated 4 years ago
- A LibreOffice extension that converts JabRef references to plain text code and vice versa so that you can use your references with MS Off…☆12Aug 15, 2024Updated last year
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35May 5, 2023Updated 2 years ago
- python script for voice activity detection.☆36Aug 16, 2024Updated last year
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Oct 14, 2022Updated 3 years ago
- Tools for working with the CMU Pronunciation Dictionary☆36Sep 5, 2017Updated 8 years ago
- A JAX library for building lattice-based speech transducer models☆46Jan 8, 2026Updated last month
- Install Kubo (go-ipfs) from NPM☆44Updated this week
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆42Sep 6, 2025Updated 5 months ago