本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。
☆55Aug 30, 2024Updated last year
Alternatives and similar repositories for VITS-Pytorch
Users that are interested in VITS-Pytorch are comparing it to the libraries listed below
Sorting:
- Cantonese TTS frontend☆16Oct 14, 2019Updated 6 years ago
- vits2 backbone with multilingual-bert, modified for Cantonese support☆25Apr 16, 2025Updated 10 months ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆76Feb 28, 2024Updated 2 years ago
- 语音合成从零开始☆11Nov 28, 2023Updated 2 years ago
- 活体检测:眨眼检测、张嘴检测、摇头检测、点头检测☆35Jun 7, 2021Updated 4 years ago
- CCC2018: Observer-Based Tracking Control for Suppressing Stick-Slip Vibration of Drillstring System☆10Aug 22, 2021Updated 4 years ago
- 使用ONNXRuntime部署一种用于边缘检测的轻量级密集卷积神经网络LDC,包含C++和Python两个版本的程序☆11Apr 24, 2023Updated 2 years ago
- VST/LV2/VST3 plugins of the KOMPASSI-Renderer☆14Jun 10, 2024Updated last year
- This is a shader can running on Minecraft Java Edition For Phone project which uses GL4ES. This repository contains source code for iOS/i…☆14Aug 13, 2023Updated 2 years ago
- Active noise controller (ANC) design: a practical primer☆13Jan 8, 2026Updated 2 months ago
- AD-HRNet:用于遥感图像语义分割的结合注意力机制和膨胀卷积的HRNet☆11Aug 13, 2023Updated 2 years ago
- Get input data from Joysticks such as the Xbox360 Controller into MATLAB. Also set vibration of the Joystick as well.☆12Dec 24, 2014Updated 11 years ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- ☆48Feb 14, 2025Updated last year
- SDKs and templates which aid in R.E.P.O. Modding☆15Jan 18, 2026Updated last month
- A discord bot with multiple features like music, reverse image search and more!☆10Updated this week
- Improving PointNet through the use of Self-Attention Layers to combine overall with fine-grained features.☆13Sep 22, 2023Updated 2 years ago
- Speech Separation☆10Jan 6, 2022Updated 4 years ago
- ☆28Jan 5, 2026Updated 2 months ago
- Processing for Hearing-Assistive/Augmented-reality Devices (HADES)☆13Jan 13, 2026Updated last month
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last month
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago
- PITS-中日英韩☆12Mar 14, 2023Updated 2 years ago
- 小模型LLM的搭建,学习LLM的建模、训练过程 基于DeepSeek-MOE架构的小模型,用于个人学习,从0开始,解释每一条语句☆14Mar 28, 2025Updated 11 months ago
- Spatial active noise control based on kernel interpolation of sound field☆13Mar 30, 2023Updated 2 years ago
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆10Apr 20, 2025Updated 10 months ago
- SSR Remote for Android☆14Nov 9, 2012Updated 13 years ago
- mouse pet-ct image segmentation☆12Feb 19, 2023Updated 3 years ago
- 使用onnxruntime部署C2PNet图像去雾,包含C++和Python两个版本的程序☆11Apr 11, 2024Updated last year
- A wrapper library of DTLN noise reduction☆17Aug 18, 2022Updated 3 years ago
- ☆11Jun 6, 2022Updated 3 years ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- A collection of minimal examples for the sparta plug-ins.☆13Jul 12, 2025Updated 7 months ago
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆13Feb 12, 2024Updated 2 years ago
- Uses a GAN to enhance images of Fingerprints☆12Jan 27, 2026Updated last month
- Sparse Multilabel Categorical Crossentropy☆11Sep 10, 2023Updated 2 years ago
- Multiple Constrained Minimum Variance (MCMV) beamformer☆13Apr 30, 2020Updated 5 years ago
- Unreal plugin with a CameraActor that captures RGB-D data and publishes it via TCP☆13Nov 1, 2024Updated last year