☆30Apr 8, 2024Updated last year
Alternatives and similar repositories for wav2lip_extension
Users that are interested in wav2lip_extension are comparing it to the libraries listed below
Sorting:
- Launch your speech synthesis within one minute.☆12May 6, 2024Updated last year
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- ☆19Oct 10, 2025Updated 5 months ago
- Play retro console games directly in SillyTavern chats using EmulatorJS☆31Jun 14, 2025Updated 8 months ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- ☆17Jul 22, 2024Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆53Updated this week
- Detect emotion from audio☆13Nov 20, 2018Updated 7 years ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- ☆18Sep 19, 2023Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Sep 17, 2024Updated last year
- Real-time end-to-end singing voice convertion☆24Nov 3, 2024Updated last year
- Scribe is a free and open-source desktop assistant for speech-to-text conversion. It allows you to control your computer with your voice…☆30Dec 13, 2025Updated 2 months ago
- ☆23Oct 17, 2024Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆21Dec 15, 2020Updated 5 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- 2400+节点可视化 Visualization | Collection of ComfyUI Custom Nodes☆25May 22, 2024Updated last year
- AudiosPlugin is a Godot iOS Audio Plugin that resolves the audio recording issue in iOS for Godot Engine.☆10Jun 16, 2025Updated 8 months ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- An echo cancellation library for browsers using DTLN-aec☆26Oct 18, 2023Updated 2 years ago
- faster inference☆28Jan 20, 2025Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆37Feb 11, 2025Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆40Sep 18, 2024Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- ☆28Dec 31, 2019Updated 6 years ago
- converts cai json file to pygmalion format☆10Jan 31, 2023Updated 3 years ago
- 将任意人的音色转换为成千上万种不同音色☆32Jun 29, 2023Updated 2 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- ☆30Jun 12, 2025Updated 8 months ago
- A hand-gesture recognition system using Doppler effect of ultrasonic.☆11Mar 2, 2019Updated 7 years ago
- AI-powered image translator with visual editing - Transform images with intelligent OCR and translation☆15Dec 22, 2025Updated 2 months ago
- A FreeSWITCH module to interface to your speech recognition server over websocket☆38Jun 25, 2025Updated 8 months ago
- Smart Health Band | 智能健康手环 A comprehensive IoT health monitoring solution featuring real-time vital signs tracking, wireless data transm…☆21Jan 8, 2026Updated 2 months ago
- ☆17Mar 2, 2026Updated last week
- ☆12Apr 20, 2025Updated 10 months ago
- real time face swap and one-click video deepfake with only a single image☆12Sep 13, 2024Updated last year