PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆17Apr 13, 2023Updated 3 years ago
Alternatives and similar repositories for Pits-Japanese-Onnx
Users that are interested in Pits-Japanese-Onnx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PITS-中日英韩☆12Mar 14, 2023Updated 3 years ago
- 基于vits fastspeech2 visinger的tts模型☆24Mar 9, 2023Updated 3 years ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Jul 19, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 某SAP的x264压制脚本☆13Dec 30, 2015Updated 10 years ago
- NTU SC2002 Group Project - Final Year Project Management System (FYPMS)☆18Aug 12, 2025Updated 8 months ago
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆147Jun 6, 2022Updated 3 years ago
- Fast screen refresh controller for the Nook Simple Touch☆40Jul 27, 2012Updated 13 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- My implementation of diffusion (like) models☆11Apr 14, 2023Updated 3 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆281Jul 16, 2023Updated 2 years ago
- Image reconstruction from human brain activity by VAE and adversarial learning☆12May 21, 2022Updated 3 years ago
- ☆14Dec 28, 2024Updated last year
- 多个SVC/TTS的C++推理库☆1,120May 18, 2025Updated 11 months ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆329Sep 24, 2022Updated 3 years ago
- 用于在浏览器本地记住B站视频专辑播放进度的油猴脚本☆10Nov 25, 2020Updated 5 years ago
- ☆21May 30, 2024Updated last year
- 一个快速制作语音数据集的可视化工具☆199Mar 7, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆61Nov 4, 2023Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- An Implementation of Singing Voice Conversion Based on Diffsinger☆74Feb 20, 2023Updated 3 years ago
- Extract TNM cancer staging from pathology notes.☆14Aug 2, 2024Updated last year
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Mar 10, 2026Updated last month
- Executable file for VITS inference☆10Jan 19, 2023Updated 3 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆62Oct 23, 2024Updated last year
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆63Sep 1, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆14Jul 13, 2020Updated 5 years ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 8 months ago
- Collected information about the (discontinued) DxO-One Camera. No warranties for anything.☆28Dec 7, 2024Updated last year
- Code for 'Alzheimer’s Disease Classification Using Cluster-based Labelling for Graph Neural Network on Tau PET Imaging and Heterogeneous …☆12Sep 13, 2022Updated 3 years ago
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- Share files with Minus from your windows explorer context menu.☆35Apr 1, 2012Updated 14 years ago
- 迅雷、快车、旋风下载链接转换脚本。☆10Apr 22, 2020Updated 5 years ago