facebookresearch / fairseq2Links
FAIR Sequence Modeling Toolkit 2
☆1,041Updated this week
Alternatives and similar repositories for fairseq2
Users that are interested in fairseq2 are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling☆1,209Updated 7 months ago
- Inworld TTS☆499Updated 2 weeks ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆825Updated 2 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,635Updated last year
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆281Updated 8 months ago
- NeMo text processing for ASR and TTS☆376Updated last week
- Code for the ALiBi method for transformer language models (ICLR 2022)☆543Updated last year
- A Neural Framework for MT Evaluation☆666Updated last month
- State-of-the-art LLM-based translation models.☆557Updated 5 months ago
- The official implementation of Self-Play Preference Optimization (SPPO)☆583Updated 8 months ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆603Updated last year
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆270Updated 6 months ago
- Library for Textless Spoken Language Processing☆550Updated 2 years ago
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆686Updated 10 months ago
- Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch☆652Updated 9 months ago
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,817Updated 8 months ago
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆650Updated last year
- Speech, Language, Audio, Music Processing with Large Language Model☆898Updated last month
- Facebook Low Resource (FLoRes) MT Benchmark☆755Updated last year
- The repository of Uni-MoE model series☆768Updated 2 weeks ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,349Updated last year
- SALMONN family: A suite of advanced multi-modal LLMs☆1,329Updated last week
- ☆875Updated this week
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,178Updated last month
- [ICCV 2025] Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"☆296Updated 8 months ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆457Updated last year
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,281Updated 7 months ago
- The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.☆262Updated 4 months ago
- PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model☆655Updated last year
- Sequence modeling with Mega.☆300Updated 2 years ago