重构GPT-SOVITS的项目,重写了部分代码,优化了webui的使用以及增加了api调用
☆29Dec 17, 2024Updated last year
Alternatives and similar repositories for GPT-SoVits
Users that are interested in GPT-SoVits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这是一个批量推理工具,对同一段文字进行多次推理,并且支持随机参数,直到筛选出最满意的结果。☆11Aug 19, 2024Updated last year
- ☆11Feb 20, 2025Updated last year
- singing voice conversion without f0☆23May 10, 2023Updated 3 years ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆69Aug 19, 2025Updated 9 months ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆41Jul 15, 2025Updated 10 months ago
- EaseVoice Trainer is a simple and user-friendly voice cloning and speech model trainer.☆14Apr 27, 2025Updated last year
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- 很好用的tnn classify demo☆11Mar 24, 2021Updated 5 years ago
- Bert-VITS2 onnx推理版本☆44Apr 24, 2024Updated 2 years ago
- 大量の音声データから笑い声部分を集めるやつ☆12May 23, 2024Updated 2 years ago
- 这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label☆15May 27, 2025Updated last year
- ☆13Jun 8, 2024Updated 2 years ago
- GPT-SoVITS 参考音频推理效果批量试听☆52Mar 8, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- text to speech☆10Mar 19, 2024Updated 2 years ago
- 一个将豆包 ASR 能力封装为 OpenAI 兼容接口的小项目,支持 Docker 启动,并提供一份可配合 Spokenly 使用的参考修正提示词,实现和 Typeless 类似的语音修正效果。☆40Feb 28, 2026Updated 3 months ago
- Dialog/Vocal Processor VST☆12May 26, 2025Updated last year
- ☆14Jan 2, 2025Updated last year
- ☆34Aug 7, 2025Updated 10 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- An audio effects plugin that simulates moving surround sound audio over headphones.☆12Nov 11, 2020Updated 5 years ago
- 人像分割SDK(支持图片和视频),支持Windows, Android, iOS。human segmentation matting☆25Aug 6, 2024Updated last year
- flow mirror models from JZX AI Labs☆43Sep 30, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Dataset, code and results repository for SBA-Net.☆14Sep 23, 2022Updated 3 years ago
- ☆49Aug 16, 2023Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 7 months ago
- VST plugin containing the 3D Tune-In Toolkit☆11Mar 31, 2022Updated 4 years ago
- This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…☆11Aug 4, 2023Updated 2 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- ☆15Jan 4, 2022Updated 4 years ago
- My implementation of Epoch-Synchronous Overlap-Add method for time stretching and pitch shifting.☆10Jan 25, 2020Updated 6 years ago
- A real-time audio processing application for standalone and mobile devices☆12Sep 10, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ☆298May 22, 2024Updated 2 years ago
- vits2 backbone with multilingual-bert, modified for Cantonese support☆26Apr 16, 2025Updated last year
- LTFT-Phase-Vocoder is an audio effect that slows down an audio signal without dilating its frequency content or pitch.☆16Dec 19, 2020Updated 5 years ago
- A simple synthesizer where the oscillator is determined by a user-defined path.☆18Nov 24, 2019Updated 6 years ago
- 基于GPT-SoVITS的视频剪辑快捷配音工具☆175Mar 15, 2024Updated 2 years ago
- A real-time voice conversion model based on VITS.☆17Aug 1, 2024Updated last year