PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆17Apr 13, 2023Updated 3 years ago
Alternatives and similar repositories for Pits-Japanese-Onnx
Users that are interested in Pits-Japanese-Onnx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PITS-中日英韩☆12Mar 14, 2023Updated 3 years ago
- 基于vits fastspeech2 visinger的tts模型☆24Mar 9, 2023Updated 3 years ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Jul 19, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 某SAP的x264压制脚本☆13Dec 30, 2015Updated 10 years ago
- NTU SC2002 Group Project - Final Year Project Management System (FYPMS)☆18Aug 12, 2025Updated 9 months ago
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- 多邻国后悔药 Duolingo Regret☆13Jan 31, 2025Updated last year
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆147Jun 6, 2022Updated 3 years ago
- Fast screen refresh controller for the Nook Simple Touch☆40Jul 27, 2012Updated 13 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- My implementation of diffusion (like) models☆11Apr 14, 2023Updated 3 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆280Jul 16, 2023Updated 2 years ago
- Image reconstruction from human brain activity by VAE and adversarial learning☆12May 21, 2022Updated 4 years ago
- Collaborate to Adapt: Source-Free Graph Domain Adaptation via Bi-directional Adaptation (WWW-2024)☆11Jul 18, 2024Updated last year
- 使用electron构建的看板娘PC端桌面挂件,live2d使用的是 https://github.com/fghrsh/live2d_demo 源码稍作修改(使用本地资源,不请求大佬的API了)☆10Dec 9, 2022Updated 3 years ago
- ☆14Dec 28, 2024Updated last year
- 多个SVC/TTS的C++推理库☆1,122May 18, 2025Updated last year
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆328Sep 24, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 用于在浏览器本地记住B站视频专辑播放进度的油猴脚本☆10Nov 25, 2020Updated 5 years ago
- ☆21May 30, 2024Updated last year
- ☆61Nov 4, 2023Updated 2 years ago
- 一个快速制作语音数据集的可视化工具☆198Mar 7, 2024Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 3 years ago
- Extract TNM cancer staging from pathology notes.☆14Aug 2, 2024Updated last year
- An Implementation of Singing Voice Conversion Based on Diffsinger☆73Feb 20, 2023Updated 3 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Mar 10, 2026Updated 2 months ago
- Executable file for VITS inference☆10Jan 19, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆62Oct 23, 2024Updated last year
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆65Sep 1, 2024Updated last year
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆14Jul 13, 2020Updated 5 years ago
- A lightweight audio codec based on a single quantizer☆70Aug 15, 2025Updated 9 months ago
- Collected information about the (discontinued) DxO-One Camera. No warranties for anything.☆29Dec 7, 2024Updated last year
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago