The Open Parallel Corpus
☆85Jan 13, 2026Updated last month
Alternatives and similar repositories for OPUS
Users that are interested in OPUS are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- OpusFilter - Parallel corpus processing toolkit☆115Feb 11, 2026Updated 2 weeks ago
- ☆82Jan 30, 2026Updated last month
- ☆14Oct 6, 2025Updated 4 months ago
- Seed Machine Translation Data☆33Nov 12, 2024Updated last year
- ☆20Dec 16, 2024Updated last year
- Curriculum training☆22Jun 25, 2025Updated 8 months ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆19Aug 28, 2023Updated 2 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated last year
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆25Nov 27, 2021Updated 4 years ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 5 months ago
- Open neural machine translation models and web services☆773Feb 23, 2026Updated last week
- This utility loads previously downloaded Common Data Model (CDM) SAS data sets into an on-prem 3rd party database.☆13Jan 5, 2026Updated last month
- Translation demonstrator☆37May 12, 2020Updated 5 years ago
- Documentation and tutorials worth sharing.☆10Dec 7, 2022Updated 3 years ago
- ☆10Sep 29, 2022Updated 3 years ago
- NLP tools for Kazakh language☆35Apr 5, 2022Updated 3 years ago
- Jdt2Famix takes Java sources and produces MSE files that can be imported into Glamorous Toolkit.☆37Dec 14, 2021Updated 4 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- Prime number library.☆12Sep 19, 2022Updated 3 years ago
- speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, …☆11Dec 7, 2025Updated 2 months ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- A web application for studying Ancient Greek texts with integrated lexical, syntactic, and morphological analysis tools.☆20Dec 1, 2025Updated 3 months ago
- Transform Oracle PL/SQL Code to Python☆11Oct 26, 2013Updated 12 years ago
- ☆12Jan 11, 2026Updated last month
- Functional composable pipelines allowing clean separation of the business logic and its implementation☆11Sep 6, 2025Updated 5 months ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- super small gpt implementation☆16Dec 15, 2024Updated last year
- ☆11Nov 30, 2025Updated 3 months ago
- Source code used in the blog☆12Feb 6, 2024Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- A method for training neural networks that are provably robust to adversarial attacks. [IJCAI 2019]☆10Sep 3, 2019Updated 6 years ago
- Geometry Utility Functions☆12Nov 28, 2016Updated 9 years ago
- A specially crafted IOCTL can be issued to the rzpnk.sys driver in Razer Synapse 2.20.15.1104 that is forwarded to ZwOpenProcess allowing…☆14Nov 8, 2020Updated 5 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆175Dec 28, 2025Updated 2 months ago
- Using Demucs in comfyUI, make Music Source Separation☆10Dec 12, 2025Updated 2 months ago
- A complete serverless quiz application showcasing the entire LocalStack platform. Demonstrates Cloud Pods, Chaos Engineering, IAM Policy …☆17Oct 19, 2025Updated 4 months ago