bitextor / bicleaner-aiView external linksLinks
Bicleaner fork that uses neural networks
☆40Jan 29, 2026Updated 2 weeks ago
Alternatives and similar repositories for bicleaner-ai
Users that are interested in bicleaner-ai are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- Efficient teacher-student models and scripts to make them☆54Dec 16, 2023Updated 2 years ago
- ☆34Nov 15, 2023Updated 2 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆57Feb 3, 2026Updated last week
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Mar 24, 2021Updated 4 years ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆386Nov 7, 2023Updated 2 years ago
- ☆13Jul 31, 2023Updated 2 years ago
- ☆133Jan 22, 2026Updated 3 weeks ago
- 🔮 LLM GPU Calculator☆21Aug 19, 2023Updated 2 years ago
- Open language modeling toolkit based on PyTorch☆174Feb 4, 2026Updated last week
- A neural word aligner based on multilingual BERT☆370Mar 10, 2022Updated 3 years ago
- MAMMOTH: MAssively Multilingual Modular Open Translation @ Helsinki☆30Updated this week
- ☆21Feb 13, 2023Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 4 months ago
- A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT☆27Jan 27, 2021Updated 5 years ago
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 5 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Apr 1, 2025Updated 10 months ago
- Open information and community for machine translation☆81Updated this week
- Bitextor generates translation memories from multilingual websites☆300Nov 11, 2024Updated last year
- Alternative implementation of the coreference scorer for the CoNLL-2011/2012 shared tasks on coreference resolution☆11Apr 29, 2021Updated 4 years ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- Terminal tool that converts files encoding to UTF-8☆10Oct 5, 2019Updated 6 years ago
- This repository defines a python class that can be used to load data for the tf.keras.model.fit_generator function by using a torch.utils…☆11Oct 26, 2024Updated last year
- UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language☆13Jan 6, 2026Updated last month
- some useless python stuff☆11Jul 30, 2020Updated 5 years ago
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Updated this week
- Répertoire officiel de changetondns.fr☆11Sep 16, 2024Updated last year
- Examples using Sonauto's generative music API☆10Mar 3, 2025Updated 11 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Engineering in Computer Science Master Degree Notes☆11Mar 4, 2023Updated 2 years ago
- ☆11Jul 6, 2023Updated 2 years ago
- Telegram bot framework written in PHP for OpenWRT☆12Nov 27, 2022Updated 3 years ago
- JSON Lines streaming serializer/deserializer on .NET and ASP.NET Core.☆14Nov 18, 2024Updated last year
- Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking☆10Feb 29, 2024Updated last year
- Code and dataset for tracing semantic changes in Russian adjectives☆12Nov 26, 2019Updated 6 years ago
- ☆10May 28, 2022Updated 3 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆24Aug 2, 2025Updated 6 months ago
- ☆10Dec 22, 2023Updated 2 years ago