☆175Dec 1, 2023Updated 2 years ago
Alternatives and similar repositories for AI-voice-chat
Users that are interested in AI-voice-chat are comparing it to the libraries listed below
Sorting:
- ☆363Jun 26, 2024Updated last year
- ☆17Sep 22, 2024Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80May 29, 2023Updated 2 years ago
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆18Dec 18, 2024Updated last year
- CLARA: Code Language Assistant & Repository Analyzer☆95Jul 4, 2023Updated 2 years ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆31May 1, 2024Updated last year
- ☆25Sep 5, 2025Updated 6 months ago
- https://hf.co/hexgrad/Kokoro-82M☆14Jan 14, 2026Updated 2 months ago
- ☆19May 2, 2024Updated last year
- Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…☆35Feb 28, 2026Updated 3 weeks ago
- ControlNet with Txt2Img | Img2Img | + Multiple LoRAs, All in one jupyter notebook for Flux.1 dev. Able to run on Google Colab Free Tier☆22Dec 1, 2024Updated last year
- OpenAI Whisper + davinci for podcast summarization☆69May 16, 2023Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆35May 10, 2025Updated 10 months ago
- [EG 2023] Sketch Video Synthesis☆221Jul 17, 2024Updated last year
- ☆135Nov 24, 2023Updated 2 years ago
- [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching☆1,261Mar 9, 2026Updated last week
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Oct 23, 2024Updated last year
- CPU inference version of VisemeNet-tensorflow☆14Nov 6, 2019Updated 6 years ago
- ☆259Mar 15, 2024Updated 2 years ago
- A collection of example programs and clients written for the Solana blockchain☆13Jun 10, 2024Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Oct 25, 2023Updated 2 years ago
- ☆1,153Feb 13, 2025Updated last year
- ☆27Apr 9, 2023Updated 2 years ago
- ☆12Feb 9, 2021Updated 5 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago
- Real time streaming talking head☆482May 17, 2024Updated last year
- ☆391Sep 3, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Open-Sora: Democratizing Efficient Video Production for All☆19Nov 7, 2024Updated last year
- ☆23Oct 19, 2024Updated last year
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- The official implementation of HierSpeech++☆1,242Feb 20, 2024Updated 2 years ago
- The implementation for "DEER: Descriptive Knowledge Graph for Explaining Entity Relationships" (EMNLP '22)☆12Oct 31, 2022Updated 3 years ago
- ☆498May 27, 2024Updated last year
- ☆14Oct 16, 2023Updated 2 years ago
- I publish my weekly research here☆20Jun 26, 2025Updated 8 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated last month
- TTS pipeline that uses RVC to enhance audio quality and cloning☆146Jan 25, 2024Updated 2 years ago