hainan-xv / zipporahView external linksLinks
☆42Jul 17, 2018Updated 7 years ago
Alternatives and similar repositories for zipporah
Users that are interested in zipporah are comparing it to the libraries listed below
Sorting:
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Simple LSTM language modelling toolkit☆10Oct 21, 2022Updated 3 years ago
- Tool for manual evaluation of parallel sentences.☆15Jan 26, 2026Updated 3 weeks ago
- Data collection, alignment and TAUS repository☆23Nov 30, 2017Updated 8 years ago
- XenC: open-source data selection tool for NLP☆64Mar 21, 2016Updated 9 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆39Aug 7, 2018Updated 7 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Efficient Markov Chain word alignment☆52Aug 1, 2021Updated 4 years ago
- Bitextor generates translation memories from multilingual websites☆300Nov 11, 2024Updated last year
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- A python implementation of the neural network joint language model and an extension of it using global source context.☆11May 17, 2017Updated 8 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- ☆24Nov 29, 2017Updated 8 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆77May 7, 2021Updated 4 years ago
- YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources☆26May 28, 2019Updated 6 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- recent audio generation papers (including speech, music and general audios)☆13Mar 14, 2023Updated 2 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 5 months ago
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Jul 13, 2017Updated 8 years ago
- Chinese-ASR built on kaldi☆14Jan 21, 2019Updated 7 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.☆77Jul 9, 2021Updated 4 years ago
- paper notes on nlp/cv/rl/dl☆14May 15, 2017Updated 8 years ago
- Text normalization scripts from IRISA lab☆14Jun 1, 2018Updated 7 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 7 years ago
- scripts used for SMT system submitted to WMT 2014☆12Apr 30, 2017Updated 8 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- Machine-Translation-based sentence alignment tool for parallel text☆314Mar 18, 2021Updated 4 years ago
- A fast LSTM Language Model for large vocabulary language like Japanese and Chinese☆111Jun 4, 2019Updated 6 years ago
- A library for data streaming and augmentation☆21May 5, 2025Updated 9 months ago
- Full Stack of Latvian Language Resources for Natural Language Understanding (NLU) and Generation (NLG)☆16Oct 20, 2022Updated 3 years ago
- Kroman is a Korean hangul romanization tool.☆32Nov 30, 2016Updated 9 years ago