XiaomingX / awesome-ai-audio-startupsLinks
这是一个专注于音频和音乐技术领域的 AI 创业公司社区列表。这个项目汇集了那些利用人工智能推动音频和音乐创新的公司,致力于通过 AI 技术在音频生成、音频增强、音乐创作、音频分析等方面推动行业的前沿发展。无论你是对音频技术创新感兴趣的开发者、创业者,还是投资者或音乐爱好者,这个列表都将是你了解 AI 音频创业生态系统的宝贵资源。
☆33Updated 5 months ago
Alternatives and similar repositories for awesome-ai-audio-startups
Users that are interested in awesome-ai-audio-startups are comparing it to the libraries listed below
Sorting:
- 用于SenseVoice的api项目,输出带时间戳字幕☆42Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- A list of open-source AI projects you can use to generate income easily.☆115Updated this week
- 一个开源的,现代设计的LLMS/人工智能聊天框架。支持多 人工智能供应商(OpenAI/Claude 3/Gemini/Ollama/Bedrock/Azure/Mistral/Conspirity),多模态(Vision/TTS)和插件系统。一键免费部署您的私人ChatGP…☆23Updated this week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated last month
- ☆132Updated 3 weeks ago
- 将任意人的音色转换为成千上万种不同音色☆32Updated 2 years ago
- ☆70Updated last year
- 根据音乐节奏自动进行视频卡点剪辑☆17Updated 4 years ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆39Updated 7 months ago
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆80Updated 10 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 9 months ago
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆55Updated 9 months ago
- so-vits-svc as a TuneFlow plugin☆52Updated 2 years ago
- Translated vocal synthesis - Clone a voice and output speech in another language☆26Updated 3 years ago
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆25Updated 3 months ago
- trying to reproduce suno v3☆34Updated 9 months ago
- ultimate vocal remover application run on linux ubuntu1604☆54Updated 2 years ago
- generate lyrics, song and background music(instrumental). Model Context Protocol (MCP) server.☆57Updated 5 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆81Updated 3 months ago
- Sora 中文的提示词 | 短视频提示词(prompt)技巧 | 调教指南。各种场景使用指南。学习怎么让它听你的话。兼顾了 Sora 的多场景应用。☆86Updated this week
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆22Updated 4 years ago
- Cantonese Text to Speech with VITS implementation☆36Updated 2 years ago
- deepseek可以用来做什么?这个项目给出的答案☆20Updated 5 months ago
- Dify DSL collection收集Dify工作流文件DSL,这里很多文件并不是本人原创,而是收集而来,感谢原作者。目前我是初学github,后面会加入大量原创内容☆20Updated 3 months ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆16Updated 2 years ago
- chat backend with GPT3/chatGPT and multilingual VITS, and multilingual speech input supported☆12Updated 2 years ago
- 完全依靠ChatGPT生成数据微调的西式翻译腔聊天风格中文大模型☆19Updated last year
- Project of Singing Voice Conversion.☆15Updated 2 years ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆42Updated 7 months ago