Mozer / wav2lip_extensionView external linksLinks
☆30Apr 8, 2024Updated last year
Alternatives and similar repositories for wav2lip_extension
Users that are interested in wav2lip_extension are comparing it to the libraries listed below
Sorting:
- Launch your speech synthesis within one minute.☆12May 6, 2024Updated last year
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- ☆19Oct 10, 2025Updated 4 months ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆52Updated this week
- Detect emotion from audio☆13Nov 20, 2018Updated 7 years ago
- Scribe is a free and open-source desktop assistant for speech-to-text conversion. It allows you to control your computer with your voice…☆29Dec 13, 2025Updated 2 months ago
- ☆18Sep 19, 2023Updated 2 years ago
- Real-time end-to-end singing voice convertion☆23Nov 3, 2024Updated last year
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Sep 17, 2024Updated last year
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- ☆23Oct 17, 2024Updated last year
- Persian Grapheme-to-Phoneme (G2P) converter☆21Dec 15, 2020Updated 5 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- 2400+节点可视化 Visualization | Collection of ComfyUI Custom Nodes☆25May 22, 2024Updated last year
- ☆25Mar 6, 2024Updated last year
- AudiosPlugin is a Godot iOS Audio Plugin that resolves the audio recording issue in iOS for Godot Engine.☆10Jun 16, 2025Updated 8 months ago
- An echo cancellation library for browsers using DTLN-aec☆26Oct 18, 2023Updated 2 years ago
- faster inference☆28Jan 20, 2025Updated last year
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆36Feb 11, 2025Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆39Sep 18, 2024Updated last year
- ☆28Dec 31, 2019Updated 6 years ago
- converts cai json file to pygmalion format☆10Jan 31, 2023Updated 3 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- 将任意人的音色转换为成千上万种不同音色☆32Jun 29, 2023Updated 2 years ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- ☆30Jun 12, 2025Updated 8 months ago
- AI-powered image translator with visual editing - Transform images with intelligent OCR and translation☆14Dec 22, 2025Updated last month
- ☆17Jan 28, 2026Updated 2 weeks ago
- A FreeSWITCH module to interface to your speech recognition server over websocket☆38Jun 25, 2025Updated 7 months ago
- real time face swap and one-click video deepfake with only a single image☆11Sep 13, 2024Updated last year
- A hand-gesture recognition system using Doppler effect of ultrasonic.☆11Mar 2, 2019Updated 6 years ago
- ☆35Mar 14, 2023Updated 2 years ago
- ☆75Oct 19, 2024Updated last year
- A template for a Djinni library that can be used in Java/Kotlin, ObjC/Swift and C#☆11Oct 6, 2022Updated 3 years ago