A large parallel corpus of English and Japanese
☆90Nov 1, 2017Updated 8 years ago
Alternatives and similar repositories for JESC
Users that are interested in JESC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.☆105Apr 29, 2021Updated 5 years ago
- 50k English-Japanese Parallel Corpus for Machine Translation Benchmark.☆98Sep 11, 2019Updated 6 years ago
- ☆22Dec 20, 2019Updated 6 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆15Jul 30, 2025Updated 9 months ago
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Nov 30, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Scripts for creating a Japanese-English parallel corpus and training NMT models☆18Nov 9, 2021Updated 4 years ago
- ☆16Aug 20, 2020Updated 5 years ago
- Decoding platform for machine translation research☆54Aug 24, 2019Updated 6 years ago
- Cynical data selection☆20Jan 16, 2021Updated 5 years ago
- NMT for chinese-english using tensor2tensor☆47Jan 15, 2018Updated 8 years ago
- Bitextor generates translation memories from multilingual websites☆299Nov 11, 2024Updated last year
- ☆63Feb 28, 2021Updated 5 years ago
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- ☆30May 20, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆24Nov 29, 2017Updated 8 years ago
- Efficient Markov Chain word alignment☆53Aug 1, 2021Updated 4 years ago
- 日本語でのMaven入門☆47Dec 8, 2025Updated 5 months ago
- Korean Parallel Corpus☆147Feb 24, 2024Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Jul 25, 2024Updated last year
- Neural macine translation soft alignment visualisations for web and command line☆73Aug 19, 2021Updated 4 years ago
- Kyoto University Web Document Leads Corpus☆84Dec 18, 2023Updated 2 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆40Jul 14, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆22Oct 26, 2020Updated 5 years ago
- ☆24Apr 19, 2026Updated last month
- Yet another sentence-level tokenizer for the Japanese text☆24Nov 27, 2025Updated 6 months ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- 敬語変換タスクにおける評価用データセット☆21Nov 24, 2022Updated 3 years ago
- ☆15Nov 5, 2020Updated 5 years ago
- CaboCha wrapper for Python3☆46Jul 5, 2018Updated 7 years ago
- A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT☆26Jan 27, 2021Updated 5 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,242Jan 12, 2026Updated 4 months ago
- Data collection, alignment and TAUS repository☆24Nov 30, 2017Updated 8 years ago
- lists of text corpus and more (mainly Japanese)☆119Jul 25, 2024Updated last year
- AMI Meeting Parallel Corpus☆12Dec 11, 2020Updated 5 years ago
- COrpus based Morphological Analyzer with INtegrated User dictionary☆21Mar 30, 2025Updated last year
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆39May 1, 2025Updated last year
- Calorie counter for the command-line with 8,000 food items (USDA)☆11Sep 21, 2017Updated 8 years ago