PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆17Apr 13, 2023Updated 2 years ago
Alternatives and similar repositories for Pits-Japanese-Onnx
Users that are interested in Pits-Japanese-Onnx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PITS-中日英韩☆12Mar 14, 2023Updated 3 years ago
- 基于vits fastspeech2 visinger的tts模型☆24Mar 9, 2023Updated 3 years ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Jul 19, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- NTU SC2002 Group Project - Final Year Project Management System (FYPMS)☆18Aug 12, 2025Updated 7 months ago
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- 多邻国后悔药 Duolingo Regret☆13Jan 31, 2025Updated last year
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆147Jun 6, 2022Updated 3 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Turn Fake Pixel Art Real — Fast, free, and runs in your browser.☆50Mar 23, 2026Updated last week
- ☆13Dec 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- My implementation of diffusion (like) models☆11Apr 14, 2023Updated 2 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆281Jul 16, 2023Updated 2 years ago
- Image reconstruction from human brain activity by VAE and adversarial learning☆12May 21, 2022Updated 3 years ago
- 多个SVC/TTS的C++推理库☆1,121May 18, 2025Updated 10 months ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆328Sep 24, 2022Updated 3 years ago
- ☆61Nov 4, 2023Updated 2 years ago
- 一个快速制作语音数据集的可视化工具☆198Mar 7, 2024Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An Implementation of Singing Voice Conversion Based on Diffsinger☆74Feb 20, 2023Updated 3 years ago
- Extract TNM cancer staging from pathology notes.☆14Aug 2, 2024Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Mar 10, 2026Updated 2 weeks ago
- Executable file for VITS inference☆10Jan 19, 2023Updated 3 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆61Oct 23, 2024Updated last year
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆62Sep 1, 2024Updated last year
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆14Jul 13, 2020Updated 5 years ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for 'Alzheimer’s Disease Classification Using Cluster-based Labelling for Graph Neural Network on Tau PET Imaging and Heterogeneous …☆12Sep 13, 2022Updated 3 years ago
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- 迅雷、快车、旋风下载链接转换脚本。☆10Apr 22, 2020Updated 5 years ago
- HTTP sever for private api☆25Updated this week
- HCC_Deep_learning☆19Jun 8, 2020Updated 5 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆176Dec 18, 2023Updated 2 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago