Convenient for developers to call inference models from version v1 to v3 through API, supporting streaming transmission and specified type file transfer.
☆43Mar 4, 2025Updated last year
Alternatives and similar repositories for GPT-SoVITS-V3-Infer-API
Users that are interested in GPT-SoVITS-V3-Infer-API are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPT-SoVITS api for v3 version☆14Mar 5, 2025Updated last year
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆16Sep 29, 2024Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- 使用命令行界面(CLI)或 Python 包进行简单易用的人声分离,采用各种出色的模型(主要由 @Anjok07 作为 UVR 项目的一部分训练)☆25Mar 1, 2026Updated 2 months ago
- A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows☆267Jan 8, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Chinese character recognition☆10Oct 27, 2020Updated 5 years ago
- 多进程 GFPGAN,提高运行效率和资源利用 ,根据设备不同提高数倍速度☆13Oct 31, 2023Updated 2 years ago
- GitHub repository linked to AnimeBackgroundGAN HuggingFace Space☆10May 24, 2022Updated 3 years ago
- ☆13Oct 11, 2024Updated last year
- ☆17May 14, 2024Updated last year
- ☆17Jun 3, 2020Updated 5 years ago
- Text to Speech for Japanese☆15May 11, 2023Updated 2 years ago
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- GUI for a Vocal Remover that uses Deep Neural Networks.☆17Jan 18, 2024Updated 2 years ago
- Table Structure Recognition☆28Jul 25, 2024Updated last year
- ☆11May 5, 2021Updated 5 years ago
- Direct Preference Optimization Implementation☆17Feb 1, 2024Updated 2 years ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- 日本語TTS(VITS)の学習と音声合成のGradio WebUI☆42Jan 5, 2024Updated 2 years ago
- ☆53Apr 2, 2026Updated last month
- Python 2/3 compatible .npz CIFAR-10 dataset☆10Mar 1, 2017Updated 9 years ago
- Declarative C++ build tool☆11Mar 29, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- text to speech using autoregressive transformer and VITS☆248Apr 3, 2024Updated 2 years ago
- ☆60Oct 22, 2025Updated 6 months ago
- ☆11Feb 20, 2025Updated last year
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆15Oct 27, 2023Updated 2 years ago
- CNTK implementation of Fully Convolutional Networks (FCN) with ResNet for semantic segmentation☆12Aug 18, 2017Updated 8 years ago
- YuE with mp3 extend, exllama and GUI☆65Feb 24, 2025Updated last year
- RedNote MCP - Xiaohongshu Content Search Tool☆24Jun 26, 2025Updated 10 months ago
- ☆39Oct 1, 2023Updated 2 years ago
- Theora video playback☆11Dec 20, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- AKShare 股票数据插件 是一个专为 Dify 平台开发的综合性股票数据工具,基于知名的 \[AKShare](https://github.com/akfamily/akshare) Python 库构建。本插件为用户提供了一站式的股票市场数据访问解决方案,涵盖实时行…☆85Nov 7, 2025Updated 5 months ago
- c# wrapper for kaldi-native-fbank,used to extract audio features in speech recognition (ASR) task☆10Jul 26, 2025Updated 9 months ago
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆13Sep 4, 2023Updated 2 years ago
- This is a multilabel classification layer for mxnet.☆12Apr 1, 2016Updated 10 years ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆112Apr 1, 2024Updated 2 years ago
- Win32 Differential Update Library☆14Dec 30, 2019Updated 6 years ago