Convenient for developers to call inference models from version v1 to v3 through API, supporting streaming transmission and specified type file transfer.
☆43Mar 4, 2025Updated last year
Alternatives and similar repositories for GPT-SoVITS-V3-Infer-API
Users that are interested in GPT-SoVITS-V3-Infer-API are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPT-SoVITS api for v3 version☆14Mar 5, 2025Updated last year
- UE5.4 Demo project for LocalLLM plugin☆24Oct 23, 2024Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- ☆13Nov 15, 2021Updated 4 years ago
- A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows☆284Jan 8, 2026Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Chinese character recognition☆10Oct 27, 2020Updated 5 years ago
- 多进程 GFPGAN,提高运行效率和资源利用,根据设备不同提高数倍速度☆13Oct 31, 2023Updated 2 years ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆10Feb 6, 2024Updated 2 years ago
- ☆17May 14, 2024Updated 2 years ago
- ☆17Jun 3, 2020Updated 6 years ago
- 自制原神技能冷却监控插件-原神云端挂机签到-水龙王转圈圈鼠标宏-原神技能冷却计时-原神云砍树☆18Dec 11, 2024Updated last year
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- GUI for a Vocal Remover that uses Deep Neural Networks.☆17Jan 18, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Compute benchmark of table structure recognition.☆29Dec 2, 2025Updated 6 months ago
- Direct Preference Optimization Implementation☆17Feb 1, 2024Updated 2 years ago
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆28Jan 20, 2024Updated 2 years ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆44Jun 13, 2024Updated 2 years ago
- 日本語TTS(VITS)の学習と音声合成のGradio WebUI☆42Jan 5, 2024Updated 2 years ago
- 基于GptSoVits项目的参考音频筛选工具☆26Aug 17, 2025Updated 9 months ago
- Python 2/3 compatible .npz CIFAR-10 dataset☆10Mar 1, 2017Updated 9 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆22Jul 5, 2023Updated 2 years ago
- text to speech using autoregressive transformer and VITS☆249Apr 3, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Feb 20, 2025Updated last year
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆16Oct 27, 2023Updated 2 years ago
- CNTK implementation of Fully Convolutional Networks (FCN) with ResNet for semantic segmentation☆12Aug 18, 2017Updated 8 years ago
- YuE with mp3 extend, exllama and GUI☆65Feb 24, 2025Updated last year
- ☆60Oct 22, 2025Updated 7 months ago
- 印章目标检测以及印章内信息提取。☆18Dec 6, 2023Updated 2 years ago
- RedNote MCP - Xiaohongshu Content Search Tool☆24Jun 26, 2025Updated 11 months ago
- ☆39Oct 1, 2023Updated 2 years ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆13Sep 4, 2023Updated 2 years ago
- This is a multilabel classification layer for mxnet.☆12Apr 1, 2016Updated 10 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆60Apr 4, 2024Updated 2 years ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆113Apr 1, 2024Updated 2 years ago
- Google word2vec tools built for windows compiled with visual studio 2017 and dev c++ on Windows 10 x64.☆15Jun 9, 2017Updated 9 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆19May 13, 2019Updated 7 years ago