facebookresearch / fairseq2
FAIR Sequence Modeling Toolkit 2
☆770Updated this week
Alternatives and similar repositories for fairseq2:
Users that are interested in fairseq2 are comparing it to the libraries listed below
- NeMo text processing for ASR and TTS☆298Updated this week
- Code for the ALiBi method for transformer language models (ICLR 2022)☆512Updated last year
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆265Updated last week
- Library for Textless Spoken Language Processing☆531Updated last year
- Large Context Attention☆677Updated last week
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆654Updated last month
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆499Updated 3 months ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆525Updated 7 months ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆361Updated 3 years ago
- Minimalistic large language model 3D-parallelism training☆1,407Updated this week
- Sequence modeling with Mega.☆297Updated 2 years ago
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,183Updated 3 months ago
- Helpful tools and examples for working with flex-attention☆603Updated this week
- State-of-the-art LLM-based translation models.☆477Updated this week
- Scalable toolkit for efficient model alignment☆697Updated this week
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆105Updated 4 months ago
- Speech, Language, Audio, Music Processing with Large Language Model☆694Updated last week
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆473Updated 2 months ago
- Automatically split your PyTorch models on multiple GPUs for training & inference☆647Updated last year
- Official implementation of Half-Quadratic Quantization (HQQ)☆737Updated 2 weeks ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆487Updated last year
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆435Updated last year
- Library for 8-bit optimizers and quantization routines.☆717Updated 2 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆520Updated last year
- Fast Inference Solutions for BLOOM☆563Updated 3 months ago
- Microsoft Automatic Mixed Precision Library☆554Updated 4 months ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆945Updated last year
- Contrastive Language-Audio Pretraining☆1,513Updated 2 months ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs…☆2,121Updated this week
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆633Updated 3 months ago