facebookresearch / fairseq2Links
FAIR Sequence Modeling Toolkit 2
☆929Updated this week
Alternatives and similar repositories for fairseq2
Users that are interested in fairseq2 are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling☆1,159Updated 3 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆779Updated 2 months ago
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,549Updated 11 months ago
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,258Updated 3 months ago
- Minimalistic large language model 3D-parallelism training☆1,926Updated last week
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆280Updated 5 months ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆535Updated last year
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,571Updated last year
- State-of-the-art LLM-based translation models.☆534Updated 2 months ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆569Updated last year
- Official implementation of Half-Quadratic Quantization (HQQ)☆832Updated this week
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆617Updated 7 months ago
- Code for BLT research paper☆1,686Updated last month
- NeMo text processing for ASR and TTS☆342Updated this week
- PyTorch extensions for high performance and large scale training.☆3,331Updated last month
- ☆2,834Updated 2 weeks ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,005Updated 10 months ago
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆3,343Updated 7 months ago
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆628Updated last year
- Helpful tools and examples for working with flex-attention☆831Updated last week
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆435Updated 2 years ago
- Fast inference engine for Transformer models☆3,856Updated 2 months ago
- Large Context Attention☆716Updated 4 months ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,491Updated this week
- Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration☆1,577Updated 5 months ago
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆561Updated last year
- Library for Textless Spoken Language Processing☆543Updated last year
- Automatically split your PyTorch models on multiple GPUs for training & inference☆655Updated last year
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆731Updated 8 months ago
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,761Updated 5 months ago