Tool for manual evaluation of parallel sentences.
☆15Jan 26, 2026Updated last month
Alternatives and similar repositories for keops
Users that are interested in keops are comparing it to the libraries listed below
Sorting:
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 5 months ago
- Corset is a web-based data selection portal that helps you getting relevant data from massive amounts of parallel data.☆20Nov 6, 2023Updated 2 years ago
- Transform TMX to text☆28Nov 23, 2022Updated 3 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- ☆42Jul 17, 2018Updated 7 years ago
- Cynical data selection☆20Jan 16, 2021Updated 5 years ago
- ☆22Dec 20, 2019Updated 6 years ago
- Data collection, alignment and TAUS repository☆23Nov 30, 2017Updated 8 years ago
- Program used to split text into segments☆28Oct 27, 2024Updated last year
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 5 months ago
- ☆10Feb 2, 2021Updated 5 years ago
- Demo of pushState and popstate functionality☆10Aug 16, 2017Updated 8 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Corpus preprocessing☆100Mar 16, 2024Updated last year
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- ☆14May 14, 2019Updated 6 years ago
- Find a mentor☆12Aug 12, 2022Updated 3 years ago
- Online webapp that scrapes news from different new portals of Nepal and worldwide. Hosted at heroku.☆11Dec 22, 2025Updated 2 months ago
- Code for "Imitation Attacks and Defenses for Black-box Machine Translations Systems"☆35May 1, 2020Updated 5 years ago
- ☆10Dec 12, 2022Updated 3 years ago
- Python bindings for the Unitex/GramLab corpus processor☆10Nov 25, 2022Updated 3 years ago
- Efficient teacher-student models and scripts to make them☆54Dec 16, 2023Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- This is a modified version of the AM29F016 or AM29F032 flash memory adapter board to easily DIY a Game Boy flash cartridge from J.Rodrigo…☆12Jun 20, 2022Updated 3 years ago
- A python implementation of the neural network joint language model and an extension of it using global source context.☆11May 17, 2017Updated 8 years ago
- A simple python webscraper for jumia☆11Jul 14, 2024Updated last year
- "The purest form of giving is from anonymous to anonymous" - Jay Z☆10Jan 6, 2021Updated 5 years ago
- laravel package to implement database transaction with ease☆14Jan 12, 2026Updated last month
- TAUS Dynamic Quality Framework API☆12Sep 17, 2020Updated 5 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- This packages sets the production configuration automatically for laravel projects for the listed domains☆10Feb 28, 2020Updated 6 years ago
- statically generated weekly digest of articles read in Pocket☆10May 14, 2019Updated 6 years ago
- 🎮Backup, restore save, dump ROM, and program Flashcarts via GBA link port☆21Sep 17, 2025Updated 5 months ago
- ☆13Jan 14, 2021Updated 5 years ago
- documentation for things like relations and parts of speech used by wordnets☆13Jun 18, 2024Updated last year
- Dockerized NMT frameworks for nmt-wizard☆39Apr 18, 2023Updated 2 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆39Aug 7, 2018Updated 7 years ago
- Resource bank for everyone in DevRel 🥑☆13Mar 4, 2024Updated last year
- Papers We Chennai☆11Dec 16, 2020Updated 5 years ago