Library for pruning experts per language pair in NLLB-200
☆34Jul 7, 2023Updated 2 years ago
Alternatives and similar repositories for nllb-pruning
Users that are interested in nllb-pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated last month
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- ☆18Nov 5, 2025Updated 5 months ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21May 30, 2022Updated 3 years ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆95Oct 30, 2024Updated last year
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 5 months ago
- Named Entity Recognition in Nepali Language☆10Jan 12, 2023Updated 3 years ago
- ☆23Dec 11, 2024Updated last year
- Paradigms of Armenian conjugation classes, and sample verb list☆16Apr 13, 2022Updated 4 years ago
- Code and data for "Heterogeneous Supervised Topic Models"☆10Jun 27, 2022Updated 3 years ago
- [ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages☆106Updated this week
- Benchmarking tool for assessing LLM models' performance across different hardwares☆17Dec 8, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆29Nov 14, 2025Updated 5 months ago
- Official code for the paper CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation published at ACL 2022 main conf…☆12Apr 6, 2023Updated 3 years ago
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 2 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ☆10Aug 31, 2023Updated 2 years ago
- A rule-based machine translation system from Ottoman Turkish to Modern Turkish.☆23Jul 8, 2020Updated 5 years ago
- ☆51Jul 25, 2024Updated last year
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- Experiments for XLM-V Transformers Integeration☆13Feb 8, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆43Oct 13, 2022Updated 3 years ago
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- ☆11Apr 2, 2024Updated 2 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)☆10Jun 18, 2019Updated 6 years ago
- LossHub: Loss Functions Library for Image Classification and Detection☆14Oct 9, 2022Updated 3 years ago
- LLM Agent that performs sentiment analysis of drawings and natural language using a combination of Google Gemini Vision model and GPT-4 T…☆13Dec 22, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- ☆43Apr 20, 2023Updated 2 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 8 months ago
- Torchreid-Pip: Packaged version of Torchreid☆13Oct 16, 2022Updated 3 years ago
- Placeholder repository☆15Mar 16, 2022Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- 🧠 Workshop Notebook and assets for the Anthropic Hackathon☆12Nov 4, 2023Updated 2 years ago