基于python的语音识别服务部署,任何一个支持一句话解码的ASR模型接口,都可仿照该框架部署自己的语音识别服务
☆55Mar 8, 2022Updated 4 years ago
Alternatives and similar repositories for ASR_python_deploy
Users that are interested in ASR_python_deploy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Pytorch based LSTM Punctuation Restoration Implementation/A Simple Tutorial for Leaning Pytorch and NLP☆24Jan 11, 2021Updated 5 years ago
- using microphone☆17Sep 2, 2021Updated 4 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆138Jun 10, 2022Updated 3 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- HippoRAG implementation using APIs☆14Jun 6, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Jan 5, 2020Updated 6 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- Recipe for LibriPhrase☆37Sep 2, 2023Updated 2 years ago
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)☆32Jul 21, 2021Updated 4 years ago
- PyTorch implementation of LiMuSE☆32Oct 11, 2022Updated 3 years ago
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆34Oct 11, 2021Updated 4 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Production First and Production Ready End-to-End Speech Recognition Toolkit☆5,098Mar 31, 2026Updated last month
- ☆33Aug 6, 2021Updated 4 years ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- Segment Anything Model (SAM) interactive demo with OpenVINO☆13Jun 5, 2024Updated last year
- A tool for calculating WER (Word Error Rate) in python.☆14Sep 18, 2024Updated last year
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion☆13May 6, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- ☆170Nov 28, 2024Updated last year
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆14Mar 22, 2023Updated 3 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 4 years ago
- Rainbow Keywords - Official PyTorch Implementation☆14Jun 27, 2024Updated last year
- 百度汉语爬虫,爬取unicode字符集中所有汉字及所有汉字所成所有词的信息,信息包括拼音、释义、百科释义、英文翻译☆11Dec 23, 2019Updated 6 years ago
- Monotonic Alignment Search☆101Jun 9, 2025Updated 10 months ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Ke-Omni-R is an advanced audio reasoning model and achieved SOTA on MMAU☆60Jun 11, 2025Updated 10 months ago
- ☆10Dec 11, 2021Updated 4 years ago
- Utility to convert Spade device video streams to MJPEG for live viewing in web browsers, VLC, etc.☆11Nov 20, 2023Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- A template for fullstack projects with Vue and FastAPI.☆20Sep 4, 2024Updated last year