facebookresearch / fairseq2
FAIR Sequence Modeling Toolkit 2
☆900Updated this week
Alternatives and similar repositories for fairseq2
Users that are interested in fairseq2 are comparing it to the libraries listed below
Sorting:
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆277Updated 3 months ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆530Updated last year
- NeMo text processing for ASR and TTS☆327Updated 3 weeks ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆762Updated last month
- Large Context Attention☆710Updated 3 months ago
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆514Updated this week
- Library for Textless Spoken Language Processing☆541Updated last year
- MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation☆383Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆574Updated last week
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆593Updated 5 months ago
- A Neural Framework for MT Evaluation☆589Updated last month
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆496Updated last year
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆557Updated 11 months ago
- ☆359Updated 8 months ago
- ☆353Updated last year
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,526Updated 10 months ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆113Updated 8 months ago
- ☆353Updated last year
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆431Updated 2 years ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆379Updated 3 years ago
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,248Updated 2 months ago
- Minimalistic large language model 3D-parallelism training☆1,870Updated this week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆871Updated 2 weeks ago
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆551Updated 4 months ago
- Unified automatic quality assessment for speech, music, and sound.☆484Updated 2 weeks ago
- ☆2,807Updated 2 weeks ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,133Updated 2 months ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆651Updated 7 months ago
- State-of-the-art LLM-based translation models.☆524Updated last month
- PyTorch Implementation of FastDiff (IJCAI'22)☆410Updated 10 months ago