jianchang512 / sense-apiView external linksLinks
用于SenseVoice的api项目,输出带时间戳字幕
☆43Oct 28, 2024Updated last year
Alternatives and similar repositories for sense-api
Users that are interested in sense-api are comparing it to the libraries listed below
Sorting:
- 一个简单的音频降噪工具,提高web UI界面和api接口☆44Nov 21, 2024Updated last year
- 一个用于F5-TTS的api和webui项目☆65Dec 25, 2024Updated last year
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 7 months ago
- human in the loop in dify workflow by plugin☆14Jan 7, 2025Updated last year
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Sep 23, 2024Updated last year
- 一个用于CosyVoice的api接口项目☆335Aug 31, 2025Updated 5 months ago
- 使用 Gemini AI 转写音视频为 SRT 字幕☆54Jan 11, 2025Updated last year
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆29Sep 1, 2025Updated 5 months ago
- 基于Dolphin模型的东方语言音视频转字幕api及webui☆19Apr 3, 2025Updated 10 months ago
- ChatTTS is a generative speech model for daily dialogue.☆14Oct 21, 2024Updated last year
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆18Oct 31, 2024Updated last year
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆20May 24, 2024Updated last year
- ☆11Feb 6, 2026Updated last week
- 一个中文语音转文字项目,封装自FireRedASR☆84Feb 24, 2025Updated 11 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆92Sep 5, 2024Updated last year
- 与rime联动的跨平台离线语音输入法☆22Nov 11, 2025Updated 3 months ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆21Jul 26, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- IMAGdressing在Windows环境下运行的webui界面☆22Jul 25, 2024Updated last year
- Real time faster whisper gradio☆25Aug 17, 2025Updated 5 months ago
- ☆29Nov 10, 2025Updated 3 months ago
- ☆69Jul 17, 2024Updated last year
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 9 months ago
- An effortless way to convert your python file to exe file in GUI. You can select your own python environment for the conversion.☆10May 10, 2023Updated 2 years ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- Port of Funasr's Sense-voice model in C/C++☆514Dec 19, 2025Updated last month
- faster inference☆28Jan 20, 2025Updated last year
- 使用SG2300X实现无瑕疵换脸☆33Sep 2, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆36Apr 25, 2025Updated 9 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- 基于 IMO25 的 Deep Think Agent,拥有强大的逻辑和指令遵循能力☆86Oct 31, 2025Updated 3 months ago
- Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powere…☆38Apr 5, 2024Updated last year
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 8 months ago
- Workflow automation, but you just describe what you want and it happens.☆26Nov 22, 2025Updated 2 months ago
- A simple WeChat Official Account layout tool based on Dify☆16Jun 27, 2025Updated 7 months ago
- ☆36Sep 6, 2025Updated 5 months ago
- [AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny 300M model!☆84Jan 29, 2026Updated 2 weeks ago