这是一款基于FunASR实现的说话人分离的GUI程序
☆160Dec 14, 2025Updated 3 months ago
Alternatives and similar repositories for AudioSeparationGUI
Users that are interested in AudioSeparationGUI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆24Dec 3, 2024Updated last year
- About 一個用於接收 WeChatPad 訊息推送的 Webhook 服務端,使用 Python 編寫,支援配置熱加載、簽名驗證、重試機制和日誌記錄,適用於需要對接微信訊息的自動化系統。☆18Jun 27, 2025Updated 9 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆539Oct 23, 2024Updated last year
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 不会聊天的字幕提取器不是一个好 B 站下载器~☆92Updated this week
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆3,993Aug 14, 2025Updated 7 months ago
- A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.☆39Aug 13, 2024Updated last year
- A talking clock in Chinese for esp32 s3 Box with mp3 player and temperature reading☆12May 7, 2023Updated 2 years ago
- ☆17Jan 31, 2026Updated last month
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆32Apr 26, 2024Updated last year
- zero-shot voice conversion & singing voice conversion, with real-time support☆11Feb 11, 2025Updated last year
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- 一个精选的Veo3 AI视频生成提示词集合,包含各种创意场景和风格的视频提示词。Awesome Veo3 Prompts - A collection of 31 creative video generation prompts for Veo3 AI, featurin…☆54Aug 8, 2025Updated 7 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14Nov 17, 2024Updated last year
- Parkinson’s Disease Classification from Speech Data using multiple Machine Learning approaches. This was implemented using scikit-learn P…☆14Feb 2, 2020Updated 6 years ago
- TalkX,一个开源的AI大模型聊天平台,支持编程插件、小智设备连接使用。☆95Oct 16, 2025Updated 5 months ago
- ☆11Jan 6, 2025Updated last year
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Jun 11, 2024Updated last year
- A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain☆11Mar 13, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Autonomous Driving W/ Deep Reinforcement Learning in Lane Keeping - DDQN and SAC with kinematics/birdview-images☆13Jul 17, 2024Updated last year
- Solution for CarRacing-v0 environment from OpenAI Gym. It uses the Deep Deterministic Policy Gradient algorithm.☆12Nov 18, 2022Updated 3 years ago
- 小智ESP32服务端配置编辑器是一个图形化工具,专为 xiaozhi-esp32-server 项目设计,旨在简化配置文件的编辑过程。通过直观的界面,用户可以轻松查看和修改 `data/.config.yaml` 文件中的各项配置,无需手动编辑YAML文件。The Xiao…☆36Mar 13, 2025Updated last year
- C++17 Neural Network (NN), Convolutional Neural Network (CNN) and Deep Learning for Esp32 on IDF from scratch☆23Aug 23, 2023Updated 2 years ago
- 实现pc端微信的mcp服务功能☆11Mar 22, 2025Updated last year
- Xiaozhi/小智Websocket协议客户端,基于Flutter开发,支持Android和iOS平台☆26Aug 12, 2025Updated 7 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- 本意是想做一个直接调用kimi的API帮我读论文的程序,然后发现API太贵了,但kimi网页版免费,就结合chrome和python写了这么个东西☆12Mar 7, 2024Updated 2 years ago
- Batch processor to enable large content be digested by Ollama, focused around book processing and translations by default, fully, configu…☆36Oct 27, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Cocktail party problem solution using deep learning☆16Jan 26, 2018Updated 8 years ago
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,851Dec 8, 2025Updated 3 months ago
- 基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试☆48Aug 4, 2025Updated 7 months ago
- 快速提取音视频内容,整理成一份结构化的markdown笔记☆2,005Jul 26, 2024Updated last year
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated 10 months ago
- scikit-learn机器学习 常用算法原理及编程实战 黄永昌编著☆13May 24, 2018Updated 7 years ago
- WhisperX FastAPI integration☆18Mar 31, 2024Updated last year