Bicleaner fork that uses neural networks
☆40Feb 23, 2026Updated 3 months ago
Alternatives and similar repositories for bicleaner-ai
Users that are interested in bicleaner-ai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient teacher-student models and scripts to make them☆57Dec 16, 2023Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- ☆13Aug 23, 2024Updated last year
- ☆38Mar 16, 2026Updated 3 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- ☆143Apr 8, 2026Updated 2 months ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆397Nov 7, 2023Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Jun 3, 2026Updated last week
- ☆15Jun 17, 2019Updated 6 years ago
- Open language modeling toolkit based on PyTorch☆188Updated this week
- A tool that locates, downloads, and extracts machine translation corpora☆165Apr 13, 2026Updated 2 months ago
- ☆13Jul 31, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆28Jul 30, 2024Updated last year
- The official code for our EMNLP 2022 long paper [Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation…☆26Sep 10, 2025Updated 9 months ago
- This is the repo that hosts the code for Mozilla's translation service☆32Feb 12, 2024Updated 2 years ago
- ☆24Apr 2, 2024Updated 2 years ago
- ☆33Nov 22, 2021Updated 4 years ago
- ☆14May 26, 2023Updated 3 years ago
- Translation quality evaluation for Firefox Translations models☆12Oct 23, 2023Updated 2 years ago
- UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language☆13Jan 6, 2026Updated 5 months ago
- A Neural Framework for MT Evaluation☆761Apr 21, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated last year
- MAMMOTH: MAssively Multilingual Modular Open Translation @ Helsinki☆32Updated this week
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆38Aug 29, 2025Updated 9 months ago
- Multilingual Entity Linking model by BELA model☆12Jul 20, 2023Updated 2 years ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆15Mar 24, 2021Updated 5 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear project…☆18Jun 1, 2021Updated 5 years ago
- Bitextor generates translation memories from multilingual websites☆299Nov 11, 2024Updated last year
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CSE201 Objected-Oriented Programming in C++: Teach an AI to produce pieces of music☆12Jan 23, 2019Updated 7 years ago
- ☆20Oct 22, 2021Updated 4 years ago
- MediaWiki Categories Model☆13Feb 14, 2024Updated 2 years ago
- ☆14Jan 4, 2021Updated 5 years ago
- Loopback web application for administration of Datawake networks☆10May 2, 2017Updated 9 years ago
- Aldebaran is a cross-platform (Discord and Revolt) multi-purposes bot which offers useful features to DiscordRPG players along with many …☆11Jan 9, 2024Updated 2 years ago
- Bilingual sentence similarity classifier using Tensorflow☆24Sep 26, 2019Updated 6 years ago