Efficient teacher-student models and scripts to make them
☆57Dec 16, 2023Updated 2 years ago
Alternatives and similar repositories for students
Users that are interested in students are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated 2 months ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Apr 8, 2026Updated 3 weeks ago
- ☆34Nov 22, 2021Updated 4 years ago
- int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991☆75Dec 30, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TranslateLocally for the Browser is a web-extension that enables client side in-page translations for web browsers.☆89Apr 2, 2025Updated last year
- A parallel evaluation data set of SAP software documentation with document structure annotation☆15Jul 30, 2025Updated 9 months ago
- Marian Translation Service☆24Feb 12, 2021Updated 5 years ago
- Dockerized NMT frameworks for nmt-wizard☆39Apr 18, 2023Updated 3 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Feb 1, 2020Updated 6 years ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- Fast Neural Machine Translation in C++ - development repository☆23May 12, 2024Updated last year
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- Customizable machine translation in C++☆56Apr 10, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆13Dec 11, 2020Updated 5 years ago
- This repository contains my research work on building the state of the art next basket recommendations using techniques such as Autoencod…☆11Mar 10, 2021Updated 5 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆75Apr 1, 2025Updated last year
- Improved Sentence Alignment in Linear Time and Space☆194Mar 6, 2023Updated 3 years ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆393Nov 7, 2023Updated 2 years ago
- Translation quality evaluation for Firefox Translations models☆12Oct 23, 2023Updated 2 years ago
- ☆20Jun 14, 2019Updated 6 years ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆233Jun 23, 2022Updated 3 years ago
- Bilingual term extractor☆59Nov 19, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tool to fix bitexts and tag near-duplicates for removal☆35Sep 4, 2025Updated 7 months ago
- Interactive Neural Machine Translation-lite (INMT-lite) is a framework to train and develop lite versions (.tflite) of models for neural …☆50Dec 19, 2025Updated 4 months ago
- Code and resources for evaluating cross-lingual embedding spaces☆29Apr 7, 2020Updated 6 years ago
- ☆21May 30, 2022Updated 3 years ago
- Bitextor generates translation memories from multilingual websites☆299Nov 11, 2024Updated last year
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 5 months ago
- Backtranslations of IMDB movie reviews for Data Augmentation Purposes☆10Apr 1, 2019Updated 7 years ago
- Standalone pre-training recipe with JAX+Flax☆35Apr 3, 2023Updated 3 years ago
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆42Mar 16, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the collection and analysis of the MTNT dataset☆56Apr 2, 2019Updated 7 years ago
- Fast and secure translation on your local machine, powered by marian and Bergamot.☆602Mar 30, 2025Updated last year
- Fast Neural Machine Translation in C++☆1,441Aug 25, 2023Updated 2 years ago
- Examples using Sonauto's generative music API☆15Mar 3, 2025Updated last year
- ☆26Jul 30, 2024Updated last year
- Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts☆18Mar 15, 2021Updated 5 years ago
- Efficient Markov Chain word alignment☆53Aug 1, 2021Updated 4 years ago