A large parallel corpus of English and Japanese
☆87Nov 1, 2017Updated 8 years ago
Alternatives and similar repositories for JESC
Users that are interested in JESC are comparing it to the libraries listed below
Sorting:
- An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.☆105Apr 29, 2021Updated 4 years ago
- 50k English-Japanese Parallel Corpus for Machine Translation Benchmark.☆98Sep 11, 2019Updated 6 years ago
- Scripts for creating a Japanese-English parallel corpus and training NMT models☆18Nov 9, 2021Updated 4 years ago
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Nov 30, 2019Updated 6 years ago
- ☆22Dec 20, 2019Updated 6 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- Decoding platform for machine translation research☆54Aug 24, 2019Updated 6 years ago
- ☆65Feb 28, 2021Updated 5 years ago
- NMT for chinese-english using tensor2tensor☆47Jan 15, 2018Updated 8 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆40Jul 14, 2020Updated 5 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- Bitextor generates translation memories from multilingual websites☆301Nov 11, 2024Updated last year
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- ☆22Feb 3, 2026Updated last month
- Practical example from Human-in-the-Loop Machine Learning book☆11Oct 28, 2021Updated 4 years ago
- Data collection, alignment and TAUS repository☆23Nov 30, 2017Updated 8 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Jul 25, 2024Updated last year
- ☆30May 20, 2022Updated 3 years ago
- A Neural Machine Translation implementation in Chainer☆46May 22, 2020Updated 5 years ago
- Korean Parallel Corpus☆147Feb 24, 2024Updated 2 years ago
- The code used fine-tuning of BERT(Transformer Neural Network Architecture)to accurately pick the correct answer among ten choices that be…☆12Dec 8, 2019Updated 6 years ago
- AMI Meeting Parallel Corpus☆11Dec 11, 2020Updated 5 years ago
- paper notes on nlp/cv/rl/dl☆14May 15, 2017Updated 8 years ago
- CaboCha wrapper for Python3☆46Jul 5, 2018Updated 7 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- ☆15Nov 5, 2020Updated 5 years ago
- small python app to help practice speech shadowing, helpful for language learning☆13Jun 25, 2020Updated 5 years ago
- Efficient Markov Chain word alignment☆53Aug 1, 2021Updated 4 years ago
- Kyoto University Web Document Leads Corpus☆83Dec 18, 2023Updated 2 years ago
- BERT with SentencePiece for Japanese text.☆33Oct 28, 2021Updated 4 years ago
- ☆12Dec 9, 2015Updated 10 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17May 18, 2023Updated 2 years ago
- Wikipediaから作成した日本語名寄せデータセット☆35Mar 10, 2020Updated 5 years ago
- Deep learning model of machine translation using attentional and structural biases☆13Jul 21, 2017Updated 8 years ago
- My implementation of LASER architecture in Fairseq☆12Oct 6, 2020Updated 5 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆37May 1, 2025Updated 10 months ago
- Various scripts that facilitate the preparation of Automatic Speech Recognition related resources☆17Apr 16, 2020Updated 5 years ago
- Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts☆18Mar 15, 2021Updated 4 years ago