facebookresearch / seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,339Updated 3 months ago
Alternatives and similar repositories for seamless_communication:
Users that are interested in seamless_communication are comparing it to the libraries listed below
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆21,505Updated last month
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆13,990Updated this week
- Faster Whisper transcription with CTranslate2☆14,234Updated last month
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,542Updated 10 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,592Updated this week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆8,508Updated 2 weeks ago
- Inference code for CodeLlama models☆16,208Updated 6 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,798Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆38,475Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆30,988Updated last month
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,480Updated 6 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,121Updated 7 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,220Updated 9 months ago
- the AI-native open-source embedding database☆17,839Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,243Updated this week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,741Updated last month
- Universal LLM Deployment Engine with ML Compilation☆20,008Updated last week
- 🔊 Text-Prompted Generative Audio Model☆36,988Updated 6 months ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆10,620Updated this week
- High-speed Large Language Model Serving for Local Deployment☆8,106Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,337Updated 6 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,881Updated 6 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37,794Updated 6 months ago
- Go ahead and axolotl questions☆8,648Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆39,047Updated this week
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,461Updated 4 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,814Updated 2 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,250Updated 8 months ago
- A guidance language for controlling large language models.☆19,671Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆36,193Updated this week