ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers
☆36Apr 22, 2026Updated 2 weeks ago
Alternatives and similar repositories for realtime-whisper
Users that are interested in realtime-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆25Feb 12, 2026Updated 2 months ago
- ☆11May 7, 2022Updated 4 years ago
- ☆12Mar 11, 2025Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 7 years ago
- ☆49Nov 26, 2023Updated 2 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 10 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated 11 months ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- This repository provides a Docker image for CosyVoice☆27Dec 22, 2024Updated last year
- ☆11Dec 24, 2024Updated last year
- ☆12Jul 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Aug 9, 2021Updated 4 years ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Sep 23, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 9 months ago
- ☆16Nov 9, 2023Updated 2 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 7 years ago
- ☆15Oct 19, 2024Updated last year
- ☆56Apr 21, 2026Updated 2 weeks ago
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆18Sep 27, 2024Updated last year
- A streaming whisper server for on-prem transcription☆23Aug 15, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆69Apr 27, 2026Updated last week
- Engineered a robust deep learning model using Convolutional Neural Networks and TensorFlow to classify 114 bird species based on audio re…☆21Jul 18, 2024Updated last year
- [CVPR2019] Synthesizing Environment-Aware Activities via Activity Sketches☆13Oct 3, 2023Updated 2 years ago
- 基于ChatGLM2带的openai_api.py修改支持ChatGLM3。☆19Oct 31, 2023Updated 2 years ago
- API to load and query documents using RAG☆14Sep 25, 2023Updated 2 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- A Docker image with Llama Index, Lang Chain, and a few other popular AI packages installed by default☆11Nov 19, 2025Updated 5 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆51Oct 12, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆31Feb 4, 2025Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- Informed Rapidly-exploring Random Tree-Star with C# Programming☆10Nov 6, 2021Updated 4 years ago
- PaddleOCR Winform/WPF Demo☆10May 10, 2022Updated 3 years ago
- paraformer web server build with sanic☆28May 3, 2023Updated 3 years ago
- PyTorch implementation for HyperMixing, a linear-time token-mixing technique used in HyperMixer architecture☆26Jun 12, 2023Updated 2 years ago
- ☆44Jan 20, 2025Updated last year