openlanguagedata / seedView external linksLinks
Seed Machine Translation Data
☆33Nov 12, 2024Updated last year
Alternatives and similar repositories for seed
Users that are interested in seed are comparing it to the libraries listed below
Sorting:
- The FLORES+ Machine Translation Benchmark☆110Nov 12, 2024Updated last year
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- ☆14Oct 4, 2024Updated last year
- (NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations☆15Apr 14, 2025Updated 9 months ago
- ☆81Jan 30, 2026Updated 2 weeks ago
- sqlite3 fts5 mecab☆22Aug 9, 2019Updated 6 years ago
- ☆20Oct 22, 2021Updated 4 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Feb 8, 2023Updated 3 years ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23May 31, 2022Updated 3 years ago
- PolYamoR is the first forward-reverse automated translation system between Python and R☆16Mar 31, 2017Updated 8 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆295Feb 5, 2026Updated last week
- Facebook Low Resource (FLoRes) MT Benchmark☆762Nov 20, 2023Updated 2 years ago
- UD_Persian☆32Nov 12, 2025Updated 3 months ago
- The Open Parallel Corpus☆84Jan 13, 2026Updated last month
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆27Sep 19, 2023Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- Kontur Platform API Gateway☆11Aug 27, 2025Updated 5 months ago
- ☆14Jan 22, 2026Updated 3 weeks ago
- tooling for vectorizing the planet☆26Feb 17, 2025Updated 11 months ago
- ☆13Feb 5, 2026Updated last week
- Morphometric taxonomy of Central Europe☆35Feb 3, 2026Updated last week
- How to support teams building Digital Public Goods?☆10May 5, 2022Updated 3 years ago
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- A semi print-in-place hand for human-like manipulation, designed to be built by anyone.☆17Jan 5, 2026Updated last month
- Unity TTS plugin: Piper neural synthesis + OpenJTalk Japanese + Unity AI Inference Engine. Windows/Mac/Linux/Android/iOS ready. High-qual…☆18Feb 6, 2026Updated last week
- ☆24Updated this week
- ☆13Jun 17, 2025Updated 7 months ago
- ☆10Feb 18, 2025Updated 11 months ago
- The Risk Modeller’s Toolkit prototype code.☆11May 7, 2021Updated 4 years ago
- Simple enhanced and easy to use database made with better-sqlite3!☆11Jun 1, 2023Updated 2 years ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆11Apr 29, 2024Updated last year
- ☆32Sep 12, 2022Updated 3 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- ☆12Nov 14, 2024Updated last year
- Some quick exploration of how k-means auto-encoders work☆11May 11, 2017Updated 8 years ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)☆12Oct 18, 2022Updated 3 years ago
- Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining☆13Oct 22, 2021Updated 4 years ago
- Supporting code for "Parallel Streaming Wasserstein Barycenters"☆10Nov 14, 2017Updated 8 years ago
- 本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。☆11May 30, 2023Updated 2 years ago