Soul-AILab / SoulX-PodcastLinks
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
☆592Updated this week
Alternatives and similar repositories for SoulX-Podcast
Users that are interested in SoulX-Podcast are comparing it to the libraries listed below
Sorting:
- ☆166Updated 11 months ago
- GPT-4o-level, real-time spoken dialogue system.☆359Updated 9 months ago
- 儿童有声读物的智能化自动化合生成,使用通义千问大模型+ Cosyvoice声音合成 + Flux 图像生成 + Paraformer 声音识别合成可用于生产的儿童有声读物☆102Updated last month
- a super fast llm response using small llm model to prefix large llm model☆232Updated 3 months ago
- ☆466Updated 5 months ago
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆379Updated 3 months ago
- ☆285Updated last year
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆320Updated 8 months ago
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆986Updated last month
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆456Updated 11 months ago
- 基于Bert-VITS2做的表情、动画测试. Animation testing based on Bert-VITS2.☆537Updated 2 months ago
- 基于 Nano Banana 的捏脸神器!精心打造您的完美肖像。使用控制选项,让 AI 将您的想象变为现实☆329Updated last month
- from Google AI Studio☆140Updated last month
- Trans Router☆166Updated 9 months ago
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆157Updated 7 months ago
- 香蕉铺子:Nano Banana 创意板,无需提示词,内置我的创意玩法大全,轻松构建创意体系☆248Updated last month
- A complete video subtitle editing React component with AI-powered speech recognition and visual editing capabilities.☆791Updated 2 weeks ago
- AI视频剪辑☆256Updated 2 months ago
- 香蕉工厂:基于 Nano Banana 和 Veo3 模型/工具,轻松实现流程自动化☆243Updated last month
- Secretary is an AI-powered tool that analyzes social media content from specified accounts and delivers results via WeChat. It supports c…☆350Updated 2 months ago
- ⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API ,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,…☆424Updated 4 months ago
- VoiceCanvas,支持Stripe支付的文本转语音系统,支持声音克隆,支持50+语言,支持选择音色,代码100%开源☆398Updated 2 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆648Updated 3 months ago
- 开源免费的 Wispr Flow 替代方案 | 集成FunASR本地模型和可配置大语言模型的下一代中文桌面语音工作流☆1,591Updated 3 weeks ago
- The World's Fastest Claude Code Launcher Make Claude Code launching simpler Click Dock to launch, or start from any folder instantly.☆356Updated last week
- Fogsight is an AI agent and animation engine powered by Large Language Models.☆1,280Updated 2 months ago
- ☆601Updated last year
- 一个利用 AI 制作漫画的工具,支持脚本创作、分镜设计和角色风格控制。☆623Updated last month
- A unified interface for multiple Text-to-Speech (TTS) providers.☆275Updated 9 months ago
- 🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one fil…☆129Updated last month