lukeewin/AudioSeparationGUI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lukeewin/AudioSeparationGUI)

lukeewin / AudioSeparationGUI

这是一款基于FunASR实现的说话人分离的GUI程序

☆163

Alternatives and similar repositories for AudioSeparationGUI

Users that are interested in AudioSeparationGUI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lukeewin / FunASR_API
View on GitHub
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
☆27Jun 16, 2026Updated last month
0x5446 / api4sensevoice
View on GitHub
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…
☆538Oct 23, 2024Updated last year
v3ucn / ASR_TOOLS_SenseVoice_WebUI
View on GitHub
Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型
☆182Jul 10, 2024Updated 2 years ago
ruzhila / voiceapi
View on GitHub
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆222Nov 2, 2025Updated 8 months ago
Ikaros-521 / FunASR_WS
View on GitHub
基于FunASR官方Demo修改的WS服务端，配合FastAPI提供HTTP服务，可以在浏览器中进行实时ASR测试
☆55Aug 4, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
modelscope / ClearerVoice-Studio
View on GitHub
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…
☆4,330Aug 14, 2025Updated 11 months ago
Arthurzhangsheng / echomimic-all-in-one-package
View on GitHub
echomimic免环境安装windows一体包，解压即用|echomimic environment-free installation Windows all-in-one package, ready to use after extraction
☆20Aug 26, 2024Updated last year
XnneHangLab / XnneHangLab
View on GitHub
希望用代码为 waifus 绘心。
☆99Updated this week
bhavika / JoyDivision
View on GitHub
Music Mood Classification on the Million Song Dataset
☆17Jul 27, 2019Updated 6 years ago
lrxwisdom001 / GPT-SoVITS-Novels
View on GitHub
Make audio books in one click! Let Genshin characters read novels for you!
☆29Aug 2, 2024Updated last year
harry0703 / AudioNotes
View on GitHub
快速提取音视频内容，整理成一份结构化的markdown笔记
☆2,222Updated this week
lukeewin / faster_whisper_streaming
View on GitHub
This is a project focused on Faster Whisper, a streaming speech recognition project.
☆18Sep 27, 2024Updated last year
pengzhendong / pysilero
View on GitHub
Python Wrapper of Silero VAD
☆63May 8, 2025Updated last year
BiboyQG / bob-cosyvoice
View on GitHub
A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.
☆39Aug 13, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
liu-qingyuan / faster_whisper_gradio
View on GitHub
Real time faster whisper gradio
☆24Aug 17, 2025Updated 11 months ago
big-mouth-cn / xiaozhi-server4j
View on GitHub
小智机器人服务端
☆18Mar 25, 2025Updated last year
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
XuSenfeng / xiaozhi-server-vision
View on GitHub
小智的视觉对话
☆34Apr 25, 2025Updated last year
pengzhendong / streaming-sensevoice
View on GitHub
Pseudo Streaming SenseVoice with Hotwords
☆467Jun 15, 2026Updated last month
big-mouth-cn / talkx
View on GitHub
TalkX，一个开源的AI大模型聊天平台，支持编程插件、小智设备连接使用。
☆98Oct 16, 2025Updated 9 months ago
haithanhp / mixconv_pytorch
View on GitHub
☆12Aug 23, 2019Updated 6 years ago
tumuyan / video-shuffler-for-aegisub
View on GitHub
Script for Aegisub to cut video and voice files | 在Aegisub中用字幕切割视频和音频文件
☆35Oct 13, 2024Updated last year
ailia-ai / onnx-quantization
View on GitHub
Example of onnx quantization
☆11Feb 8, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Eric-coder / MayabatchExportAbc
View on GitHub
Maya后台批量导出abc缓存文件
☆11Sep 17, 2020Updated 5 years ago
modelscope / 3D-Speaker
View on GitHub
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
☆3,070Dec 8, 2025Updated 7 months ago
kodiful / plugin.video.tver
View on GitHub
☆12May 6, 2026Updated 2 months ago
findstr / xiaozhi-esp32-server-mini
View on GitHub
适用于 NAS、路由器、树莓派等轻量级设备的 xiaozhi-esp32 服务端
☆41May 7, 2026Updated 2 months ago
yangdongchao / Target-sound-event-detection
View on GitHub
The source code for target sound detection
☆15Feb 26, 2022Updated 4 years ago
lihui600 / CNN-use-C-achieve
View on GitHub
LeNet-5 use c achieve
☆13Jan 10, 2020Updated 6 years ago
ReLuckyLucy / Another_Me
View on GitHub
基于roop与codeFormer的换脸一体脚本
☆21Apr 9, 2025Updated last year
LuckLittleBoy / SenseVoice-OneApi
View on GitHub
基于SenseVoice的funasr版本进行的api发布，可以无缝对接oneapi
☆92Sep 5, 2024Updated last year
RapidAI / RapidASR
View on GitHub
📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …
☆608May 15, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RoversCode / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆11Feb 11, 2025Updated last year
jianchang512 / sense-api
View on GitHub
用于SenseVoice的api项目，输出带时间戳字幕
☆49Oct 28, 2024Updated last year
SNTube / Streaming-Captions
View on GitHub
基于Streaming-SenseVoice项目的伪流式实时字幕界面
☆13Apr 15, 2025Updated last year
chen97 / SRT2PRXML
View on GitHub
Online tool to convert the subtitle file (SRT) to PremierePro-supported XML format.
☆12Nov 28, 2023Updated 2 years ago
Ikaros-521 / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆14Nov 17, 2024Updated last year
Asstar-X / AsLive
View on GitHub
一个实时交互的语音项目
☆52May 20, 2026Updated 2 months ago
zhangnn520 / znn_chatglm
View on GitHub
打造人人都会的NLP，开源不易，记得star哦
☆101Apr 28, 2023Updated 3 years ago