facebookresearch / fairseq2Links
FAIR Sequence Modeling Toolkit 2
☆1,114Updated last week
Alternatives and similar repositories for fairseq2
Users that are interested in fairseq2 are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling☆1,264Updated 11 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆861Updated 3 months ago
- Inworld TTS☆647Updated 4 months ago
- NeMo text processing for ASR and TTS☆424Updated this week
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆644Updated last year
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,699Updated last year
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆666Updated 2 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆467Updated 2 years ago
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆745Updated last year
- The official implementation of Self-Play Preference Optimization (SPPO)☆582Updated last year
- Library for Textless Spoken Language Processing☆555Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆295Updated this week
- Code for the ALiBi method for transformer language models (ICLR 2022)☆549Updated 2 years ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆286Updated 10 months ago
- Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration☆1,595Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆944Updated 2 months ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆396Updated 4 years ago
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,866Updated last year
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆562Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,361Updated last year
- A Framework for Speech, Language, Audio, Music Processing with Large Language Model☆966Updated 3 weeks ago
- Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch☆654Updated last year
- State-of-the-art LLM-based translation models.☆577Updated 9 months ago
- APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention☆270Updated 2 months ago
- SALMONN family: A suite of advanced multi-modal LLMs☆1,387Updated 4 months ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆1,056Updated last year
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆122Updated last year
- maximal update parametrization (µP)☆1,673Updated last year
- A Neural Framework for MT Evaluation☆712Updated this week
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆520Updated 2 years ago