XiaomingX / awesome-ai-audio-startupsLinks

这是一个专注于音频和音乐技术领域的 AI 创业公司社区列表。这个项目汇集了那些利用人工智能推动音频和音乐创新的公司，致力于通过 AI 技术在音频生成、音频增强、音乐创作、音频分析等方面推动行业的前沿发展。无论你是对音频技术创新感兴趣的开发者、创业者，还是投资者或音乐爱好者，这个列表都将是你了解 AI 音频创业生态系统的宝贵资源。

☆33

Alternatives and similar repositories for awesome-ai-audio-startups

Users that are interested in awesome-ai-audio-startups are comparing it to the libraries listed below

Sorting:

jianchang512 / sense-api
用于SenseVoice的api项目，输出带时间戳字幕
☆42Updated last year
pnkvalavala / multivoice
Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …
☆26Updated 2 years ago
garylab / MakeMoneyWithAI
A list of open-source AI projects you can use to generate income easily.
☆115Updated this week
find-xposed-magisk / lobe-chat
一个开源的，现代设计的LLMS/人工智能聊天框架。支持多人工智能供应商（OpenAI/Claude 3/Gemini/Ollama/Bedrock/Azure/Mistral/Conspirity），多模态（Vision/TTS）和插件系统。一键免费部署您的私人ChatGP…
☆23Updated this week
lovemefan / SenseVoice-python
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
☆106Updated last month
fishaudio / fish-audio-python
☆132Updated 3 weeks ago
ishine / vc-lm
将任意人的音色转换为成千上万种不同音色
☆32Updated 2 years ago
NZqian / RapBank
☆70Updated last year
Meteor8 / beats-synced-video-generator
根据音乐节奏自动进行视频卡点剪辑
☆17Updated 4 years ago
MiniMax-AI / audio-tools
A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…
☆39Updated 7 months ago
zmeet-ai / asr_demo
语音识别API，分实时语音和长语音离线上传识别，支持中英文等多达100个国家的语言实时转写和同声传译
☆80Updated 10 months ago
warmshao / ChatTTSPlus
Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment
☆172Updated 9 months ago
ABexit / Multi-Character-StoryTeller
This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…
☆55Updated 9 months ago
tuneflow / so-vits-svc-plugin
so-vits-svc as a TuneFlow plugin
☆52Updated 2 years ago
andrew-fennell / CogNative
Translated vocal synthesis - Clone a voice and output speech in another language
☆26Updated 3 years ago
Atotti / miipher-2
Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。
☆25Updated 3 months ago
multimodal-art-projection / Open-Suno
trying to reproduce suno v3
☆34Updated 9 months ago
NanKeRen2020 / UVR5_Linux
ultimate vocal remover application run on linux ubuntu1604
☆54Updated 2 years ago
SkyworkAI / Mureka-mcp
generate lyrics, song and background music(instrumental). Model Context Protocol (MCP) server.
☆57Updated 5 months ago
ETomberg391 / Ecne-AI-Podcaster
AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast
☆81Updated 3 months ago
SoraEase / sora-prompt-zh
Sora 中文的提示词 | 短视频提示词（prompt）技巧 | 调教指南。各种场景使用指南。学习怎么让它听你的话。兼顾了 Sora 的多场景应用。
☆86Updated this week
leonardltk / Shazam-An-Industrial-Strength-Audio-Search-Algorithm-
Detecting segments belonging to which song in database, and return Nil if does not exist in a database.
☆22Updated 4 years ago
Keith-Hon / vits-cantonese
Cantonese Text to Speech with VITS implementation
☆36Updated 2 years ago
XiaomingX / awesome-deepseek
deepseek可以用来做什么？这个项目给出的答案
☆20Updated 5 months ago
Paulzhang2023 / Dify-DSL-collection
Dify DSL collection收集Dify工作流文件DSL，这里很多文件并不是本人原创，而是收集而来，感谢原作者。目前我是初学github，后面会加入大量原创内容
☆20Updated 3 months ago
jxaizj / Modify-Anything
Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…
☆16Updated 2 years ago
lrioxh / backend-with-gpt-vits
chat backend with GPT3/chatGPT and multilingual VITS, and multilingual speech input supported
☆12Updated 2 years ago
Benson114 / Translational-Style-ChatLLM
完全依靠ChatGPT生成数据微调的西式翻译腔聊天风格中文大模型
☆19Updated last year
SLPcourse / Singing-Voice-Conversion
Project of Singing Voice Conversion.
☆15Updated 2 years ago
c4fun / tell-stories-webui
Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play
☆42Updated 7 months ago