Seed Machine Translation Data
☆34Nov 12, 2024Updated last year
Alternatives and similar repositories for seed
Users that are interested in seed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The FLORES+ Machine Translation Benchmark☆112Nov 12, 2024Updated last year
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆30Feb 8, 2023Updated 3 years ago
- ☆15Oct 4, 2024Updated last year
- ☆82Jan 30, 2026Updated 3 months ago
- ☆20Oct 22, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23May 31, 2022Updated 3 years ago
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆11Nov 2, 2015Updated 10 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆306Updated this week
- The Open Parallel Corpus☆88May 5, 2026Updated 3 weeks ago
- BLEU Score in Rust☆12May 20, 2026Updated last week
- Bindings to BLAS (Fortran)☆12May 28, 2025Updated last year
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- ☆18Nov 25, 2022Updated 3 years ago
- Computational Use of Data Agreement - Removing Barriers to Data Innovation☆21Jun 12, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for ACL 2022 paper 'Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation'☆12Jun 7, 2024Updated last year
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- Facebook Low Resource (FLoRes) MT Benchmark☆768Nov 20, 2023Updated 2 years ago
- An UWP client software for ASRT speech recognition system. 一个可用于ASRT语音识别系统的UWP客户端软件☆12Oct 23, 2019Updated 6 years ago
- Training scripts for Argos Translate☆156Jan 18, 2026Updated 4 months ago
- A Flex/Bison Parser for Blazonry - A Mediaeval Graphical Description Language☆14Apr 23, 2021Updated 5 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆23Oct 26, 2021Updated 4 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 6 months ago
- 一个简单、高效的将公历(阳历)日期转化为农历(阴历)日期的算法☆10Feb 5, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- text classification using ELMO☆16Dec 8, 2018Updated 7 years ago
- Ready to run PyTorch implementation of Data2Vec 2.0: Highly efficient self-supervised representation learning for vision, speech and text…☆16Mar 29, 2023Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆87Jun 5, 2024Updated last year
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆67Oct 25, 2024Updated last year
- ☆27Apr 14, 2025Updated last year
- Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.☆10Sep 19, 2022Updated 3 years ago
- A place to host demos for custom actions.☆14Feb 2, 2022Updated 4 years ago
- ☆59Dec 6, 2024Updated last year
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 7 months ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- Wikimedia Enterprise - client SDK in Python☆21May 4, 2026Updated 3 weeks ago
- ☆17Nov 23, 2021Updated 4 years ago
- simple kv store for streams☆36Mar 14, 2013Updated 13 years ago
- A tutorial on how to build your own Neural Language Model☆10Dec 8, 2022Updated 3 years ago