tincans-ai / gazelle
Joint speech-language model - respond directly to audio!
☆365Updated 6 months ago
Alternatives and similar repositories for gazelle:
Users that are interested in gazelle are comparing it to the libraries listed below
- ☆195Updated 7 months ago
- On-device intelligence.☆216Updated 4 months ago
- Video+code lecture on building nanoGPT from scratch☆65Updated 7 months ago
- A ggml (C++) re-implementation of tortoise-tts☆174Updated 4 months ago
- ☆266Updated 7 months ago
- Collection of Open Source Speech Data☆151Updated 2 months ago
- ☆254Updated 10 months ago
- ☆471Updated 7 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated 8 months ago
- Whisper with Medusa heads☆818Updated 2 weeks ago
- Implementation of F5-TTS in MLX☆429Updated last week
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆293Updated 7 months ago
- Open source conversation framework and visual editor for structured Pipecat dialogues☆94Updated this week
- Joint speech-language model - respond directly to audio!☆30Updated 8 months ago
- ☆154Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆114Updated this week
- ☆325Updated 4 months ago
- ☆250Updated this week
- FastMLX is a high performance production ready API to host MLX models.☆252Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆148Updated 6 months ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆615Updated 8 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆184Updated 2 months ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆177Updated 9 months ago
- Interface for OuteTTS models.☆859Updated this week
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆468Updated last year
- 🐮📢 The first AI voice assistant that interrupts *you*☆136Updated 4 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆81Updated 8 months ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆162Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆62Updated this week
- ☆255Updated 7 months ago