facebookresearch / seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,156Updated 2 months ago
Alternatives and similar repositories for seamless_communication:
Users that are interested in seamless_communication are comparing it to the libraries listed below
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆21,325Updated this week
- Universal LLM Deployment Engine with ML Compilation☆19,630Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆37,496Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,751Updated 11 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆13,023Updated 3 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,312Updated 5 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,096Updated 5 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆15,910Updated this week
- 🔊 Text-Prompted Generative Audio Model☆36,678Updated 4 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,168Updated 7 months ago
- Large Language Model Text Generation Inference☆9,592Updated this week
- Inference code for CodeLlama models☆16,150Updated 5 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,688Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,197Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆13,382Updated this week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,499Updated 9 months ago
- AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head☆10,071Updated 6 months ago
- ImageBind One Embedding Space to Bind Them All☆8,476Updated 5 months ago
- Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory☆20,611Updated this week
- Community interface for generative AI☆8,896Updated 8 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,699Updated last week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆16,235Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,113Updated 8 months ago
- ☆7,720Updated 9 months ago
- ☆7,938Updated 7 months ago
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆41,616Updated this week
- Tensor library for machine learning☆11,541Updated this week
- StableLM: Stability AI Language Models☆15,831Updated 9 months ago
- the AI-native open-source embedding database☆17,023Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆16,978Updated this week