facebookresearch / fairseq2Links
FAIR Sequence Modeling Toolkit 2
☆1,112Updated last week
Alternatives and similar repositories for fairseq2
Users that are interested in fairseq2 are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling☆1,260Updated 10 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆858Updated 3 months ago
- NeMo text processing for ASR and TTS☆417Updated last week
- Inworld TTS☆629Updated 4 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆294Updated last week
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆644Updated last year
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆665Updated 2 years ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,697Updated last year
- Code for the ALiBi method for transformer language models (ICLR 2022)☆549Updated 2 years ago
- Audio Large Language Models☆857Updated 6 months ago
- Library for Textless Spoken Language Processing☆555Updated 2 years ago
- The official implementation of Self-Play Preference Optimization (SPPO)☆583Updated last year
- A Framework for Speech, Language, Audio, Music Processing with Large Language Model☆958Updated 2 weeks ago
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆745Updated last year
- SALMONN family: A suite of advanced multi-modal LLMs☆1,383Updated 4 months ago
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Updated 2 years ago
- State-of-the-art LLM-based translation models.☆577Updated 9 months ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆670Updated last year
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆466Updated last year
- A Neural Framework for MT Evaluation☆711Updated 4 months ago
- Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch☆654Updated last year
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆284Updated 10 months ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆395Updated 4 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆569Updated 2 years ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,220Updated 2 weeks ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆121Updated last year
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆465Updated 2 years ago
- ☆386Updated last year
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆1,050Updated last year
- XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)☆346Updated last year