这是一款基于FunASR实现的说话人分离的GUI程序
☆162Dec 14, 2025Updated 6 months ago
Alternatives and similar repositories for AudioSeparationGUI
Users that are interested in AudioSeparationGUI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆25Dec 3, 2024Updated last year
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆541Oct 23, 2024Updated last year
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆181Jul 10, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Feb 12, 2026Updated 4 months ago
- Make audio books in one click! Let Genshin characters read novels for you!☆29Aug 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆4,233Aug 14, 2025Updated 10 months ago
- 希望用代码为 waifus 绘心。☆96Jun 7, 2026Updated last week
- A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.☆39Aug 13, 2024Updated last year
- A talking clock in Chinese for esp32 s3 Box with mp3 player and temperature reading☆12May 7, 2023Updated 3 years ago
- ☆19Jan 31, 2026Updated 4 months ago
- Script for Aegisub to cut video and voice files | 在Aegisub中用字幕切割视频和音频文件☆35Oct 13, 2024Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆33Apr 26, 2024Updated 2 years ago
- zero-shot voice conversion & singing voice conversion, with real-time support☆11Feb 11, 2025Updated last year
- Python Wrapper of Silero VAD☆63May 8, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 5 months ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated 2 years ago
- 适用于 NAS、路由器、树莓派等轻量级设备的 xiaozhi-esp32 服务端☆40May 7, 2026Updated last month
- 帮助视觉障碍者识别物品,障碍物提醒☆98Mar 14, 2025Updated last year
- 一个精选的Veo3 AI视频生成提示词集合,包含各种创意场景和风格的视频提示词。Awesome Veo3 Prompts - A collection of 31 creative video generation prompts for Veo3 AI, featurin…☆71Aug 8, 2025Updated 10 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14Nov 17, 2024Updated last year
- Parkinson’s Disease Classification from Speech Data using multiple Machine Learning approaches. This was implemented using scikit-learn P…☆14Feb 2, 2020Updated 6 years ago
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Oct 19, 2020Updated 5 years ago
- TalkX,一个开源的AI大模型聊天平台,支持编程插件、小智设备连接使用。