facebookresearch / fairseq2Links
FAIR Sequence Modeling Toolkit 2
☆1,038Updated last week
Alternatives and similar repositories for fairseq2
Users that are interested in fairseq2 are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling☆1,192Updated 6 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆822Updated last month
- NeMo text processing for ASR and TTS☆373Updated last week
- Inworld TTS☆486Updated this week
- Code for the ALiBi method for transformer language models (ICLR 2022)☆542Updated last year
- The official implementation of Self-Play Preference Optimization (SPPO)☆580Updated 7 months ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆599Updated last year
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,618Updated last year
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆282Updated 7 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆267Updated 6 months ago
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆664Updated 9 months ago
- Library for Textless Spoken Language Processing☆550Updated 2 years ago
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,277Updated 6 months ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆118Updated last year
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆644Updated last year
- Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch☆651Updated 8 months ago
- APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention☆255Updated 4 months ago
- State-of-the-art LLM-based translation models.☆553Updated 5 months ago
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆537Updated 4 months ago
- One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks☆3,060Updated this week
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆759Updated last month
- Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration☆1,582Updated 8 months ago
- Minimalistic large language model 3D-parallelism training☆2,191Updated 2 weeks ago
- MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation☆388Updated 2 years ago
- Audio Large Language Models☆715Updated 2 months ago
- A repository for research on medium sized language models.☆510Updated 3 months ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆661Updated 11 months ago
- ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “E…☆268Updated 2 years ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆969Updated last year
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆453Updated last year