用于SenseVoice的api项目,输出带时间戳字幕
☆42Oct 28, 2024Updated last year
Alternatives and similar repositories for sense-api
Users that are interested in sense-api are comparing it to the libraries listed below
Sorting:
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 用于kokoro TTS的webui界面和兼容openai api☆39Feb 4, 2025Updated last year
- 一个简单的音频降噪工具,提高web UI界面和api接口☆44Nov 21, 2024Updated last year
- 一个用于F5-TTS的api和webui项目☆64Dec 25, 2024Updated last year
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- human in the loop in dify workflow by plugin☆14Jan 7, 2025Updated last year
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated 2 years ago
- MT3:多任务多音轨音乐转录的 Gradio 演示。(全中文汉化)☆12Mar 24, 2025Updated 11 months ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆41Sep 23, 2024Updated last year
- 基于FastAPI的语音服务系统,集成语音合成(TTS)和语音识别(STT)功能。使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,支持零样本语音克隆、流式输出、多种语言识别等高级功能。☆20Apr 1, 2025Updated 11 months ago
- 一个用于CosyVoice的api接口项目☆336Aug 31, 2025Updated 6 months ago
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆28Sep 1, 2025Updated 6 months ago
- ChatTTS is a generative speech model for daily dialogue.☆14Oct 21, 2024Updated last year
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆20May 24, 2024Updated last year
- ☆11Feb 25, 2026Updated last week
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆18Oct 31, 2024Updated last year
- 一个中文语音转文字项目,封装自FireRedASR☆83Feb 24, 2025Updated last year
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆92Sep 5, 2024Updated last year
- 通过LLM进行进行字幕断句分割,处理和优化字幕文件,将自动语音识别(ASR)数据的分段合并与拆分,☆138Dec 17, 2024Updated last year
- 与rime联动的跨平台离线语音输入法☆22Nov 11, 2025Updated 3 months ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆21Jul 26, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- IMAGdressing在Windows环境下运行的webui界面☆22Jul 25, 2024Updated last year
- Protective hooks for Claude Code that prevent accidental code loss through branch protection, automatic checkpointing, and safe commit …☆48Sep 15, 2025Updated 5 months ago
- Real time faster whisper gradio☆25Aug 17, 2025Updated 6 months ago
- Extracting time features from text using a Finite State Transducer (FST) in Python☆53Dec 1, 2025Updated 3 months ago
- ComfyUI implementation of FlashFace: Human Image Personalization with High-fidelity Identity Preservation☆26Jul 31, 2024Updated last year
- ☆29Nov 10, 2025Updated 3 months ago
- ☆69Jul 17, 2024Updated last year
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 9 months ago
- An effortless way to convert your python file to exe file in GUI. You can select your own python environment for the conversion.☆10May 10, 2023Updated 2 years ago
- 使用yolov10目标检测模型进行电路板缺陷检测 | Using yolov10 for circuit board (PCB) defect detection☆49Dec 14, 2025Updated 2 months ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- Port of Funasr's Sense-voice model in C/C++☆522Dec 19, 2025Updated 2 months ago
- ☆39Jan 20, 2025Updated last year
- 使用SG2300X实现无瑕疵换脸☆33Sep 2, 2024Updated last year
- Colab notebooks for Next-gen Kaldi☆30Oct 12, 2025Updated 4 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆36Apr 25, 2025Updated 10 months ago