本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。
☆57Aug 30, 2024Updated last year
Alternatives and similar repositories for VITS-Pytorch
Users that are interested in VITS-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cantonese TTS frontend☆16Oct 14, 2019Updated 6 years ago
- Live2D + ASR + LLM + TTS → Real-time communication + Offline Deployment/Cloud Inference 实时沟通 本地部署/云端推理☆40Apr 21, 2025Updated last year
- Drive your metahuman to speak within 1 second.☆11Mar 21, 2025Updated last year
- vits2 backbone with multilingual-bert, modified for Cantonese support☆26Apr 16, 2025Updated last year
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆75Feb 28, 2024Updated 2 years ago
- A research project and comparative study on various Active Noise Cancellation Algorithms like FxLMS, EMFN, Chebyshev filter and Hammerste…☆10Jul 3, 2022Updated 3 years ago
- 记录学习geneface++所遇到的各种问题☆12Aug 5, 2024Updated last year
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆723Dec 17, 2025Updated 5 months ago
- ☆34Jul 29, 2025Updated 10 months ago
- 基于ultralytics训练的行人跌倒检测模型☆20Jul 10, 2023Updated 2 years ago
- 记录关于AEC的论文和代码、博客以及相关资料☆15Jul 26, 2022Updated 3 years ago
- ☆13Jun 24, 2021Updated 4 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆40Jan 4, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Dataset simulation for DPCCN.☆16Dec 25, 2022Updated 3 years ago
- ☆49Oct 24, 2023Updated 2 years ago
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- 谷歌浏览器插件学习与开发☆12Mar 11, 2019Updated 7 years ago
- 仿照Photoshop的在线图像处理软件。演示地址:☆12Jul 8, 2018Updated 7 years ago
- This repository contains an unofficial pytorch implementation of BSRNN for music separation, attempting to reproduce the results of the o…☆12May 7, 2026Updated last month
- Small compression utility☆38Jan 20, 2026Updated 4 months ago
- ☆18Apr 2, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 支持Typecho1.1的赞赏功能代码☆15Aug 25, 2018Updated 7 years ago
- an method to make vlm think like r1☆21May 28, 2025Updated last year
- Pantone color libraries as .acb files for Photoshop etc☆22Jun 6, 2024Updated 2 years ago
- VITS2 for Chinese speech | 最新VITS2中文语音合成☆134Oct 26, 2023Updated 2 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- ☆42Jan 4, 2024Updated 2 years ago
- ☆14Oct 19, 2024Updated last year
- ☆21Apr 27, 2024Updated 2 years ago
- 语音合成从零开始☆11Nov 28, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 基于Paddle进行语义检索并部署上线,支持多语言 This code is based on Paddle to do a semantic search, and deploy it. Multilingual support☆13Aug 11, 2022Updated 3 years ago
- This is a Real-time howling detection and suppression algorithm using Matlab simulink.☆46Mar 21, 2023Updated 3 years ago
- A real-time voice conversation system based on WebSocket and LLM, integrating Automatic Speech Recognition (ASR), Large Language Model co…☆20Feb 11, 2025Updated last year
- 使用ONNXRuntime部署一种用于边缘检测的轻量级密集卷积神经网络LDC,包含C++和Python两个版本的程序☆11Apr 24, 2023Updated 3 years ago
- Turn any Windows precision touchpad into a touchscreen.☆12Oct 21, 2018Updated 7 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆160Aug 9, 2025Updated 10 months ago