ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers
☆36Apr 22, 2026Updated last month
Alternatives and similar repositories for realtime-whisper
Users that are interested in realtime-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Feb 12, 2026Updated 3 months ago
- ☆11May 7, 2022Updated 4 years ago
- ☆12Mar 11, 2025Updated last year
- OpenCV Sample Projects in Rust☆12Nov 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- search papers of cvpr 2023 by chat gpt☆14Jun 15, 2023Updated 2 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 7 years ago
- ☆49Nov 26, 2023Updated 2 years ago
- NLP examples(almost Japanese) on AWS☆12May 31, 2022Updated 4 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Hpyformer base FunASR☆31Nov 5, 2024Updated last year
- This repository provides a Docker image for CosyVoice☆27Dec 22, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Dec 24, 2024Updated last year
- ☆13Mar 22, 2018Updated 8 years ago
- ☆12Jul 11, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- RustPBX go client☆51May 26, 2026Updated last week
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- TensorFlow Lite Model Makerで物体検出を行うハンズオン用資料です(Hands-on for object detection with TensorFlow Lite Model Maker)☆19Dec 3, 2021Updated 4 years ago
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.☆16Sep 18, 2021Updated 4 years ago
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆29Feb 27, 2025Updated last year
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 7 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- ☆15Oct 19, 2024Updated last year
- Colab notebooks for Next-gen Kaldi☆31Oct 12, 2025Updated 7 months ago
- ☆19Jan 28, 2021Updated 5 years ago
- Modified version of OpenVINO noise_suppression_demo. This version can handle real-time audio stream from microphone and output to headpho…☆16Aug 5, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆31May 11, 2026Updated 3 weeks ago
- 记账应用-前端 http://119.3.214.158/ds-cash/☆11Nov 23, 2022Updated 3 years ago
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated 2 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- ☆57Apr 21, 2026Updated last month
- ☆13Dec 15, 2022Updated 3 years ago