paraformer(chinense asr) online onnx runtime for python
☆53Mar 27, 2024Updated last year
Alternatives and similar repositories for paraformer-python
Users that are interested in paraformer-python are comparing it to the libraries listed below
Sorting:
- paraformer web server build with sanic☆28May 3, 2023Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆39Jun 19, 2024Updated last year
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- Python的音频工具☆16Dec 5, 2025Updated 2 months ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆128Apr 26, 2023Updated 2 years ago
- ☆23Oct 17, 2024Updated last year
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆24Sep 12, 2024Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆109Oct 6, 2025Updated 4 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆20Feb 12, 2026Updated 2 weeks ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Sep 1, 2025Updated 5 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated last year
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆30Apr 26, 2024Updated last year
- ☆12Jul 11, 2024Updated last year
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆104Mar 30, 2025Updated 11 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 3 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆41Sep 23, 2024Updated last year
- ☆16Nov 9, 2023Updated 2 years ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 8 months ago
- Whatsapp Gateway For Delphi☆12Aug 18, 2021Updated 4 years ago
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- Pseudo Streaming SenseVoice with Hotwords☆428Mar 13, 2025Updated 11 months ago
- ☆15Aug 25, 2022Updated 3 years ago
- A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification☆14Jan 19, 2021Updated 5 years ago
- Compute WER and SER for speech recognition evaluation☆26Dec 15, 2025Updated 2 months ago