facebookresearch / fairseq2Links
FAIR Sequence Modeling Toolkit 2
☆1,012Updated this week
Alternatives and similar repositories for fairseq2
Users that are interested in fairseq2 are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling☆1,165Updated 4 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆789Updated 3 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆280Updated 5 months ago
- NeMo text processing for ASR and TTS☆347Updated this week
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆626Updated 8 months ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆581Updated last year
- The official implementation of Self-Play Preference Optimization (SPPO)☆569Updated 5 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆264Updated 4 months ago
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆630Updated last year
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,575Updated last year
- State-of-the-art LLM-based translation models.☆542Updated 3 months ago
- Library for Textless Spoken Language Processing☆545Updated last year
- Code for the ALiBi method for transformer language models (ICLR 2022)☆536Updated last year
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆745Updated 2 months ago
- Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration☆1,575Updated 6 months ago
- ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “E…☆268Updated 2 years ago
- Audio Large Language Models☆611Updated 2 weeks ago
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,773Updated 6 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,338Updated last year
- Awesome speech/audio LLMs, representation learning, and codec models☆1,071Updated this week
- Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"☆197Updated 11 months ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆116Updated 10 months ago
- Speech, Language, Audio, Music Processing with Large Language Model☆853Updated last month
- One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks☆2,758Updated this week
- The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.☆234Updated 2 months ago
- Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs☆827Updated 2 months ago
- ☆370Updated 10 months ago
- A recipe for online RLHF and online iterative DPO.☆522Updated 6 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆894Updated 2 months ago
- APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention☆242Updated 2 months ago