A web app that lets you play around with TalkNet models
☆124Jul 31, 2023Updated 2 years ago
Alternatives and similar repositories for ControllableTalkNet
Users that are interested in ControllableTalkNet are comparing it to the libraries listed below
Sorting:
- NeMo: a toolkit for conversational AI☆12Dec 23, 2022Updated 3 years ago
- NVIDIA's TalkNET - Train and Synthesize on colab☆15Dec 6, 2025Updated 2 months ago
- Custom content tool for The Ponies.☆12Feb 4, 2025Updated last year
- NVIDIA's TalkNET - Train on colab☆37Mar 15, 2023Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 5 months ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Dec 24, 2022Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89May 27, 2021Updated 4 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Mar 11, 2025Updated 11 months ago
- A Python/Pytorch app for easily synthesising human voices☆1,445Dec 2, 2024Updated last year
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- ☆17Nov 3, 2022Updated 3 years ago
- ☆22Jan 25, 2026Updated last month
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A Neural Audio Codec (NAC) for Universal Audio☆44May 30, 2025Updated 9 months ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Mar 24, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 3 years ago
- A unified, browser-based interface for pony voice generation☆46Jan 16, 2025Updated last year
- Tools to isolate speaker and transcribe unstructured audio clips☆11Dec 4, 2022Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- Audio datasets, easier.☆86Aug 19, 2023Updated 2 years ago
- Stable diffusion google colab kernel☆10Aug 17, 2022Updated 3 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- Next-generation, fully open-source refacer. Images. GIFs. TIFFs. Full-length videos. Bulk refacing☆41May 16, 2025Updated 9 months ago
- GPT for FACodec☆13Mar 25, 2024Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆37Dec 31, 2025Updated 2 months ago
- ☆33Jun 29, 2023Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆94Nov 6, 2023Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆32Jun 15, 2023Updated 2 years ago
- The implementation of the paper *SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors* [CVPR 2025]☆37Nov 11, 2025Updated 3 months ago
- Doing style transfer with linguistic features using OpenAI's CLIP.☆14May 4, 2021Updated 4 years ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆58Sep 11, 2020Updated 5 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Dec 21, 2022Updated 3 years ago
- Collect Voice Conversion researches☆96Updated this week
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆34Jul 21, 2023Updated 2 years ago