spicytigermeat / neuTalkView external linksLinks
Open Source Text-to-Speech GUI Tool running on TalkNet
☆11Dec 24, 2022Updated 3 years ago
Alternatives and similar repositories for neuTalk
Users that are interested in neuTalk are comparing it to the libraries listed below
Sorting:
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 4 months ago
- ☆14Mar 11, 2022Updated 3 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Jun 24, 2023Updated 2 years ago
- A cross-platform audio recorder designed for recording using recording lists (a.k.a. reclists).☆13Feb 7, 2026Updated last week
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Home of the Chunkmogrify project☆16Jan 11, 2022Updated 4 years ago
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89May 27, 2021Updated 4 years ago
- Python scripts I made to make NNSVS labeling easier.☆27Jun 20, 2023Updated 2 years ago
- ☆24Sep 27, 2022Updated 3 years ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- Next-generation, fully open-source refacer. Images. GIFs. TIFFs. Full-length videos. Bulk refacing☆41May 16, 2025Updated 8 months ago
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆26Oct 9, 2021Updated 4 years ago
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- Line by line audio recording tool for vocal libraries☆29Feb 23, 2024Updated last year
- The Original Support for English NNSVS Dataset Creation☆29Nov 14, 2024Updated last year
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆37Aug 18, 2024Updated last year
- ☆14Dec 5, 2025Updated 2 months ago
- TU Darmstadt - Deep Learning: Architectures & Methods Project SS21☆37Dec 17, 2024Updated last year
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆40Aug 4, 2023Updated 2 years ago
- Resampler for UTSU that works on Mac, Windows, and Linux☆39Jun 8, 2024Updated last year
- ☆40Jun 5, 2023Updated 2 years ago
- Learn Japanese using music. Frontend written in Nuxt.js and optional backend using Litserve☆19Jun 2, 2025Updated 8 months ago
- This is a HeadSwap project not only face☆34Dec 28, 2022Updated 3 years ago
- A simple UTAU voicebank recorder app for android.☆12Feb 3, 2025Updated last year
- A simple lightweight library for text normalization for Indian Languages☆16Sep 30, 2025Updated 4 months ago
- ☆10Jan 10, 2024Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- A GUI for the Neutrino neural singing synthesizer☆51Jan 14, 2021Updated 5 years ago
- NVIDIA's TalkNET - Train on colab☆37Mar 15, 2023Updated 2 years ago
- ☆10Apr 18, 2025Updated 9 months ago
- Reorganizes Booru Datasets from Gwern to be valid for DeepDanbooru☆12Aug 5, 2021Updated 4 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- ☆14Sep 14, 2024Updated last year
- ☆12Mar 6, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 3 years ago
- ☆10Mar 21, 2025Updated 10 months ago
- tf-openpose and unity IK☆10Jul 2, 2020Updated 5 years ago
- singing voice conversion based on glow-tts☆12Aug 20, 2023Updated 2 years ago