SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be easily built standalone without any depencency.
☆101Dec 14, 2024Updated last year
Alternatives and similar repositories for SummerAsr
Users that are interested in SummerAsr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synt…☆527Jul 10, 2025Updated 9 months ago
- rkllm_talking is a standalone compiled voice communication system based on a large model || rkllm_talking 是一个独立编译的基于大模…☆13Oct 13, 2024Updated last year
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- KWS demo based on CTC prefix beam search.☆17Oct 21, 2023Updated 2 years ago
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆548Mar 19, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆605May 15, 2024Updated last year
- Simple VAD (voice activity detection) algorithm written in C☆14Jan 5, 2026Updated 3 months ago
- ☆40Aug 15, 2021Updated 4 years ago
- Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…☆18Oct 21, 2022Updated 3 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- some ncnn demos of FunASR☆28Sep 23, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32May 10, 2023Updated 2 years ago
- Web chat front end for rk3588_npu_llm_server / RK3588 LLM chat interface☆16Jul 16, 2024Updated last year
- A tiny audio speech (.wav) utility tool (GUI) based on Python2.7+wxPython4.0+PyAudio+Matplotlib+SpeechRecognition(PocketSphinx)+pyttsx3(e…☆18Nov 4, 2019Updated 6 years ago
- mfcc, mel, pcen. (librosa)☆36Nov 20, 2019Updated 6 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆110Aug 16, 2024Updated last year
- Create MP4 videos from JPG/PNG/GIF/BMP images☆14Feb 21, 2015Updated 11 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation☆42Nov 17, 2023Updated 2 years ago
- A simple TTS(text-to-speech) engine for Chinese mandarin☆21Feb 20, 2012Updated 14 years ago
- mnn tts demo.☆19May 7, 2025Updated 11 months ago
- ☆14Jan 31, 2023Updated 3 years ago
- ☆41Oct 8, 2024Updated last year
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆83Sep 1, 2022Updated 3 years ago
- A talking clock in Chinese for esp32 s3 Box with mp3 player and temperature reading☆12May 7, 2023Updated 2 years ago
- TTS Text Analyzer☆31Jul 20, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- 数据集自动化制作脚本☆72Mar 26, 2023Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆185Jun 20, 2025Updated 9 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆132Apr 26, 2023Updated 2 years ago
- Desktop application for neural speech synthesis written in C++☆213Feb 5, 2026Updated 2 months ago