Library for pruning experts per language pair in NLLB-200
☆34Jul 7, 2023Updated 2 years ago
Alternatives and similar repositories for nllb-pruning
Users that are interested in nllb-pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- COMET for African languages☆11Jan 24, 2025Updated last year
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated 3 weeks ago
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- ☆18Nov 5, 2025Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- ☆21May 30, 2022Updated 3 years ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆96Oct 30, 2024Updated last year
- A bilingual dataset for image captioning☆19Oct 28, 2020Updated 5 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 4 months ago
- Named Entity Recognition in Nepali Language☆10Jan 12, 2023Updated 3 years ago
- Benchmark API for Multidomain Language Modeling☆25Aug 26, 2022Updated 3 years ago
- ☆22Dec 11, 2024Updated last year
- ☆56Jul 16, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Apr 20, 2024Updated last year
- This is the reporsitory of Turkish fake news dataset which consists of Zaytung posts and Hurriyet news articles.☆15Jun 12, 2019Updated 6 years ago
- Benchmarking tool for assessing LLM models' performance across different hardwares☆17Dec 8, 2023Updated 2 years ago
- ☆30Nov 14, 2025Updated 4 months ago
- Meta's "No Language Left Behind" models served as web app and REST API☆259May 27, 2025Updated 10 months ago
- Official code for the paper CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation published at ACL 2022 main conf…☆12Apr 6, 2023Updated 2 years ago
- This is a demo project showing how to fine-tune and deploy the Whisper model on SageMaker.☆25Dec 20, 2023Updated 2 years ago
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 2 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- A rule-based machine translation system from Ottoman Turkish to Modern Turkish.☆23Jul 8, 2020Updated 5 years ago
- ☆10Aug 31, 2023Updated 2 years ago
- ☆51Jul 25, 2024Updated last year
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- sougou医学词库爬取☆13Nov 21, 2019Updated 6 years ago
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Code and Data for Evaluation WG☆42May 4, 2022Updated 3 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆11Apr 2, 2024Updated last year
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)☆10Jun 18, 2019Updated 6 years ago
- LLM Agent that performs sentiment analysis of drawings and natural language using a combination of Google Gemini Vision model and GPT-4 T…☆13Dec 22, 2023Updated 2 years ago
- ☆43Apr 20, 2023Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago