jianchang512/sense-api

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jianchang512/sense-api)

jianchang512 / sense-api

用于SenseVoice的api项目，输出带时间戳字幕

☆49

Alternatives and similar repositories for sense-api

Users that are interested in sense-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jianchang512 / kokoro-uiapi
View on GitHub
用于kokoro TTS的webui界面和兼容openai api
☆41Feb 4, 2025Updated last year
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
hmjz100 / MT3
View on GitHub
MT3：多任务多音轨音乐转录的 Gradio 演示。（全中文汉化）
☆12Mar 24, 2025Updated last year
CrispStrobe / Susurrus
View on GitHub
speech to text gui for different (e.g. Whisper, Voxtral) models and backends, including whisper.cpp, crispasar, mlx-whisper, faster-whisp…
☆27Updated this week
jianchang512 / cosyvoice-api
View on GitHub
一个用于CosyVoice的api接口项目
☆333Aug 31, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
davideuler / local-whisper-input
View on GitHub
local whisper input by Whisper or SenseVoice/FunASR
☆22Mar 5, 2025Updated last year
jianchang512 / speech2text-df
View on GitHub
基于Dolphin模型的东方语言音视频转字幕api及webui
☆19Apr 3, 2025Updated last year
jianchang512 / remove-noise
View on GitHub
一个简单的音频降噪工具,提高web UI界面和api接口
☆46Nov 21, 2024Updated last year
Anvarjon / Age-Gender-Classification
View on GitHub
Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…
☆28Mar 5, 2024Updated 2 years ago
dengcunqin / noise-reduction
View on GitHub
noise reduction
☆17Jul 3, 2024Updated 2 years ago
jianchang512 / f5-tts-api
View on GitHub
一个用于F5-TTS的api和webui项目
☆63Dec 25, 2024Updated last year
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago
Mddct / usm-tokenizer
View on GitHub
semantic tokenizer for speech and music
☆20Jul 6, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
colaudiolab / AudioSet-R
View on GitHub
Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"
☆19Oct 9, 2025Updated 9 months ago
RemSynch / SenseVoice-Real-Time
View on GitHub
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆42Sep 23, 2024Updated last year
ruzhila / voiceapi
View on GitHub
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆222Nov 2, 2025Updated 8 months ago
hegugu-ng / DDTV_WEBUI
View on GitHub
基于ddtv开放的api开发的一套webUI，基于VUE 3.0开发。
☆12Apr 3, 2023Updated 3 years ago
kodeleung / CosyVoice2
View on GitHub
基于官方提供的CosyVoice改造，整体交互适配CosyVoice2模型，开箱即用
☆23Jun 15, 2025Updated last year
jianchang512 / fireredasr-ui
View on GitHub
一个中文语音转文字项目，封装自FireRedASR
☆88Feb 24, 2025Updated last year
nuanarchy / ComfyUI-NuA-FlashFace
View on GitHub
ComfyUI implementation of FlashFace: Human Image Personalization with High-fidelity Identity Preservation
☆26Jul 31, 2024Updated last year
LuckLittleBoy / SenseVoice-OneApi
View on GitHub
基于SenseVoice的funasr版本进行的api发布，可以无缝对接oneapi
☆92Sep 5, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jkfujr / RecStatus
View on GitHub
第三方录播姬/BLREC/等录播机管理面板
☆22Jun 27, 2026Updated 3 weeks ago
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
Mddct / cosyvoice2-flow-optimized
View on GitHub
faster inference
☆27Jan 20, 2025Updated last year
Wasser1462 / FunASR-nano-onnx
View on GitHub
A lightweight demo of FunASR-Nano using ONNX runtime.
☆83Feb 25, 2026Updated 5 months ago
zaigie / FunSpeech
View on GitHub
开箱即用的本地私有化部署语音服务，快速搭建Qwen3ASR/FunASR与Qwen3TTS/CosyVoice后端
☆153Jul 6, 2026Updated 2 weeks ago
AIFSH / NativeSpeakerUI
View on GitHub
☆40Feb 28, 2024Updated 2 years ago
jkfujr / BilibiliCookieMgmt
View on GitHub
BILIBILI COOKIE 管理器
☆16Updated this week
lifeiteng / OmniSenseVoice
View on GitHub
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
☆898Dec 10, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
JuliaMusic / PianoHands.jl
View on GitHub
(Experimental) Predicting hand assignments in piano MIDI using neural networks
☆13Oct 11, 2024Updated last year
Ranqumn / FoE_pink_eyes_publish
View on GitHub
《辐射小马国：粉色双眸》的重排版
☆12Oct 11, 2019Updated 6 years ago
pngwn / gradio-imageslider
View on GitHub
ImageSlider custom component for gradio.
☆43May 20, 2024Updated 2 years ago
scottishfold0621 / ACMID
View on GitHub
☆26Apr 30, 2026Updated 2 months ago
Hong-01 / Python-File-to-EXE-File-Converter
View on GitHub
An effortless way to convert your python file to exe file in GUI. You can select your own python environment for the conversion.
☆10May 10, 2023Updated 3 years ago
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago