Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)
☆262Jul 3, 2023Updated 2 years ago
Alternatives and similar repositories for LIHQ
Users that are interested in LIHQ are comparing it to the libraries listed below
Sorting:
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆526Dec 26, 2023Updated 2 years ago
- AI_Video_Shorts_Creator is a python-based tool that uses OpenAI's GPT-4 power to automatically analyze videos, extract the most interesti…☆18Sep 22, 2023Updated 2 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Oct 23, 2024Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆229Oct 10, 2022Updated 3 years ago
- Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation☆997Dec 4, 2023Updated 2 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Dec 24, 2022Updated 3 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- ☆24Sep 27, 2022Updated 3 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513☆64Feb 13, 2023Updated 3 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- [ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation☆657Mar 26, 2023Updated 2 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Apr 29, 2022Updated 3 years ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆98Jun 7, 2022Updated 3 years ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆131Dec 8, 2023Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆146Aug 22, 2022Updated 3 years ago
- Collect Voice Conversion researches☆96Updated this week
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 9 months ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Jun 12, 2023Updated 2 years ago
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- [CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation☆546May 21, 2023Updated 2 years ago
- A high resolution and faster face editing framework (TPAMI)☆140Sep 9, 2023Updated 2 years ago
- Updated fork of wav2lip-hq allowing for the use of current ESRGAN models☆54May 6, 2024Updated last year
- Code for Few-Shot Head Swapping in the Wild (CVPR 2022 Oral)☆258Apr 28, 2022Updated 3 years ago
- Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)☆1,282Jun 19, 2023Updated 2 years ago
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆379Jan 12, 2025Updated last year
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18May 15, 2025Updated 9 months ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- ncnn HiFi-GAN☆29Sep 29, 2024Updated last year
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,659Oct 18, 2024Updated last year
- A pre-trained face parser based on SegNeXt☆50May 16, 2023Updated 2 years ago