这是一款基于FunASR实现的说话人分离的GUI程序
☆162Dec 14, 2025Updated 4 months ago
Alternatives and similar repositories for AudioSeparationGUI
Users that are interested in AudioSeparationGUI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆24Dec 3, 2024Updated last year
- About 一個用於接收 WeChatPad 訊息推送的 Webhook 服務端,使用 Python 編寫,支援配置熱加載、簽名驗證、重試機制和日誌記錄,適用於需要對接微信訊息的自動化系統。☆19Jun 27, 2025Updated 10 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆540Oct 23, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆25Feb 12, 2026Updated 2 months ago
- Make audio books in one click! Let Genshin characters read novels for you!☆29Aug 2, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆18Sep 27, 2024Updated last year
- 希望用代码为 waifus 绘心。☆95Updated this week
- A talking clock in Chinese for esp32 s3 Box with mp3 player and temperature reading☆12May 7, 2023Updated 2 years ago
- ☆19Jan 31, 2026Updated 3 months ago
- Script for Aegisub to cut video and voice files | 在Aegisub中用字幕切割视频和音频文件☆35Oct 13, 2024Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆33Apr 26, 2024Updated 2 years ago
- 一个精选的Veo3 AI视频生成提示词集合,包含各种创意场景和风格的视频提示词。Awesome Veo3 Prompts - A collection of 31 creative video generation prompts for Veo3 AI, featurin…☆62Aug 8, 2025Updated 8 months ago
- 适用于 NAS、路由器、树莓派等轻量级设备的 xiaozhi-esp32 服务端☆38Apr 15, 2026Updated 3 weeks ago
- TalkX,一个开源的AI大模型聊天平台,支持编程插件、小智设备连接使用。☆97Oct 16, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Jan 6, 2025Updated last year
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,939Mar 17, 2026Updated last month
- 小智ESP32服务端配置编辑器是一个图形化工具,专为 xiaozhi-esp32-server 项目设计,旨在简化配置文件的编辑过程。通过直观的界面,用户可以轻松查看和修改 `data/.config.yaml` 文件中的各项配置,无需手动编辑YAML文件。The Xiao…☆38Mar 13, 2025Updated last year
- C++17 Neural Network (NN), Convolutional Neural Network (CNN) and Deep Learning for Esp32 on IDF from scratch☆23Aug 23, 2023Updated 2 years ago
- 实现pc端微信的mcp服务功能☆11Mar 22, 2025Updated last year
- Simple OBS whiteboard/telestrator plugin☆14Aug 1, 2025Updated 9 months ago
- Xiaozhi/小智Websocket协议客户端,基于Flutter开发,支持Android和iOS平台☆29Aug 12, 2025Updated 8 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Batch processor to enable large content be digested by Ollama, focused around book processing and translations by default, fully, configu…☆36Oct 27, 2025Updated 6 months ago
- Real time faster whisper gradio☆25Aug 17, 2025Updated 8 months ago
- 一款基于Gin+Vue+ElementUI的前后端分离权限管理系统,以 Golang、Gin、Xorm、Vue、ElementUI、MySQL等技术栈开发平台框架,拥有完善的(RBAC)权限架构和基础核心管理模块,为了缩短研发周期,系统框架集成了代码生成器,内置平台自定义研…☆19May 19, 2022Updated 3 years ago
- A simple module/way to use Perplexity AI in Python.☆13May 9, 2024Updated last year
- 基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试☆49Aug 4, 2025Updated 9 months ago
- quick way to build a private large language model server and provide OpenAI-compatible interfaces | 快速搭建私有大语言模型(LLM)服务,提供OpenAI兼容接口☆34Jan 6, 2024Updated 2 years ago
- 快速提取音视频内容,整理成一份结构化的markdown笔记☆2,041Jul 26, 2024Updated last year
- Inpaint anything using Segment Anything and inpainting models. Add visual selection of candidate points.☆10May 9, 2023Updated 2 years ago
- scikit-learn机器学习 常用算法原理及编程实战 黄永昌编著☆13May 24, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- WhisperX FastAPI integration☆18Mar 31, 2024Updated 2 years ago
- 这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了 喂给BERT前的label☆16May 27, 2025Updated 11 months ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 4 years ago
- A Stock Price prediction system using LLM and Multi-agent-system☆27Oct 24, 2023Updated 2 years ago
- LLaSA WebUI using ExLlamaV2 and FastAPI.☆28Mar 30, 2025Updated last year
- Auto generated swig python module with a binary compnent☆11Apr 19, 2012Updated 14 years ago
- 小智的视觉对话☆33Apr 25, 2025Updated last year