BeautyyuYanli / GPT-SoVITS-InferLinks
The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.
☆15Updated last year
Alternatives and similar repositories for GPT-SoVITS-Infer
Users that are interested in GPT-SoVITS-Infer are comparing it to the libraries listed below
Sorting:
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆56Updated 4 months ago
- ☆33Updated last year
- A unified interface for multiple Text-to-Speech (TTS) providers.☆273Updated 9 months ago
- 用文本编辑器剪视频☆37Updated 2 years ago
- ☆33Updated last year
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆27Updated last year
- Change☆57Updated 9 months ago
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆107Updated last year
- A telegram bot for OpenAI API☆30Updated 4 months ago
- coze api to openai☆15Updated last year
- AI no jimaku gumi (AIの字幕組), a subtitle maker for video using AI.☆192Updated 8 months ago
- Simple script to quickly implement DDNS based on CloudFlare.☆17Updated last month
- 🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one fil…☆123Updated 3 weeks ago
- An AI agent to control drones from your CLI☆134Updated 2 months ago
- 利用小米人在传感器实现每日自动化☆36Updated 4 months ago
- Play "Bad Apple!!" in the psql client. This demo is to illustrate that the psql client can play some animation.☆48Updated 2 years ago
- 基于Funasr的[实时]AI语音助手☆19Updated 4 months ago
- ☆24Updated 10 months ago
- 基于深度学习的语音增强工具(Speech Enhancement Tools Based on Deep Learning)☆124Updated 2 years ago
- Janus-Series: Unified Multimodal Understanding and Generation Models forked from deepseek-ai/Janus☆17Updated 8 months ago
- ☆70Updated last year
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆96Updated last year
- ☆50Updated 3 weeks ago
- self hosted whisper api system based on container☆64Updated last year
- Speech Diarization for scrum automation☆111Updated 2 years ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 8 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆46Updated last year
- A class for generating realistic audio (TTS) for podcasts and dialogues.☆63Updated 9 months ago
- A voice assistant that runs completely on your local device.☆72Updated 2 months ago
- Big map for Google I/O 2025☆31Updated 4 months ago