Corset is a web-based data selection portal that helps you getting relevant data from massive amounts of parallel data.
☆21Nov 6, 2023Updated 2 years ago
Alternatives and similar repositories for corset
Users that are interested in corset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transform TMX to text☆27Nov 23, 2022Updated 3 years ago
- ☆13Aug 23, 2024Updated last year
- Program used to split text into segments☆28Oct 27, 2024Updated last year
- Tool to fix bitexts and tag near-duplicates for removal☆35Sep 4, 2025Updated 9 months ago
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Mar 25, 2024Updated 2 years ago
- Auto-Encoding Variational Neural Machine Translation☆16Jan 22, 2020Updated 6 years ago
- Unofficial implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis, using Flax with the Linen API☆13Sep 25, 2021Updated 4 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated 4 months ago
- ☆15Nov 5, 2020Updated 5 years ago
- Cynical data selection☆20Jan 16, 2021Updated 5 years ago
- Hwyluso cyfieithu peirianyddol MosesSMT i'r Gymraeg // Making MosesSMT machine translation easier for Welsh (and other languages)☆16Aug 25, 2021Updated 4 years ago
- Nix source☆14Nov 21, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆82Jan 30, 2026Updated 4 months ago
- Fast stand-alone C++ decoder for RNN-based NMT models☆31Dec 12, 2020Updated 5 years ago
- A Workflow Manager in Python☆50May 29, 2026Updated 2 weeks ago
- Unit testing for Nix code using Lix☆16Sep 15, 2025Updated 9 months ago
- A go pipeline management library, supporting concurrent pipelines, with multiple nodes and joints☆15Mar 24, 2026Updated 2 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Experiment in Nix formatting☆23Oct 4, 2019Updated 6 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆81Apr 8, 2023Updated 3 years ago
- Github Actions for automatically generating the personal awesome list from all of the repositories you starred.☆16Mar 6, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Mosaic trees managment tool and library☆14Nov 17, 2015Updated 10 years ago
- ☆11Nov 28, 2025Updated 6 months ago
- A booklet on the Smacc compiler compiler framework☆15Jun 6, 2026Updated last week
- Chrome based DID wallet for Ceramic Network | ETHOnline Winner☆11Nov 21, 2020Updated 5 years ago
- ☆10Apr 22, 2022Updated 4 years ago
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Dec 21, 2021Updated 4 years ago
- This is a modified version of the AM29F016 or AM29F032 flash memory adapter board to easily DIY a Game Boy flash cartridge from J.Rodrigo…☆12Jun 20, 2022Updated 3 years ago
- ☆14Updated this week
- gevent.core implemented as cffi module, might be used with pypy☆56Apr 30, 2014Updated 12 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A token swapping interface app.☆12May 8, 2022Updated 4 years ago
- Parser and code browser for the ActiveOberon language (original version from 2004)☆29May 9, 2026Updated last month
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- RIBES is an automatic evaluation metric for machine translation.☆13Sep 7, 2017Updated 8 years ago
- Reversed engeneered tradingview-scan api for scan cryptocurrencies (trendlines, oscillators, performance)☆17Mar 6, 2019Updated 7 years ago
- Module to control FFMPEG actions (TRIM, CLIPS, CONCAT, MERGE)☆10Jan 4, 2023Updated 3 years ago
- DEPAY/ETH liquidity staking smart contract☆10Aug 24, 2021Updated 4 years ago