Joint speech-language model - respond directly to audio!
☆373Jul 1, 2024Updated last year
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- ☆16Oct 6, 2024Updated last year
- ☆19Mar 22, 2024Updated 2 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆61Oct 23, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆39Apr 15, 2024Updated last year
- ☆55Jul 16, 2025Updated 8 months ago
- A fast multimodal LLM for real-time voice☆4,379Dec 12, 2025Updated 3 months ago
- ☆29Feb 4, 2025Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆61Nov 4, 2023Updated 2 years ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Jan 26, 2024Updated 2 years ago
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆759Nov 19, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Real-time Speech-Text Foundation Model Toolkit (wip)☆254Mar 26, 2025Updated last year
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 5 months ago
- ☆260Mar 15, 2024Updated 2 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models