Real-time Speech To Text using Faster Whisper.
☆60Aug 12, 2024Updated last year
Alternatives and similar repositories for Real-time-STT
Users that are interested in Real-time-STT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆59Aug 12, 2024Updated last year
- Real-time transcription using faster-whisper☆615Jul 23, 2024Updated last year
- Kamailio in Kubernetes configuration manager☆14May 2, 2019Updated 7 years ago
- ☆14Apr 1, 2025Updated last year
- ☆11Sep 8, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- ☆13Oct 27, 2025Updated 7 months ago
- A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.☆22Updated this week
- ☆27Nov 3, 2025Updated 6 months ago
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated 2 years ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆359Jul 20, 2025Updated 10 months ago
- ☆21May 11, 2024Updated 2 years ago
- A collection of Korean NLP hands-on labs on Amazon SageMaker☆19Dec 20, 2023Updated 2 years ago
- ☆44Nov 19, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆18Jun 27, 2025Updated 11 months ago
- ☆33Mar 26, 2025Updated last year
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Sep 6, 2023Updated 2 years ago
- Coffee Chat Voice Assistant is a voice-driven ordering system powered by Azure OpenAI GPT-4o Realtime API, simulating the experience of o…☆31May 4, 2026Updated 3 weeks ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Use this library to connect your iOS, WatchOS, or MacOS app to the Vuzix Z100™ smart glasses.☆15Mar 18, 2025Updated last year
- Hpyformer base FunASR☆31Nov 5, 2024Updated last year
- ☆31Apr 5, 2025Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆21May 24, 2024Updated 2 years ago
- Real time transcription with OpenAI Whisper.☆2,938Apr 15, 2025Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Feb 12, 2026Updated 3 months ago
- ☆14Aug 9, 2021Updated 4 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- This is a middleware which wraps the Asterisk's AMI interface commands into Postgres/SQL functions☆19Jun 11, 2018Updated 7 years ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- WebRTC-HTTP Ingestion Protocol (WHIP) in Rust☆15Dec 17, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- 📞 AGI interface with python for speech recognition☆30Feb 19, 2024Updated 2 years ago
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- 🐮📢 The first AI voice assistant that interrupts *you*☆148Sep 6, 2024Updated last year
- ☆11Apr 15, 2026Updated last month
- ☆15Oct 19, 2024Updated last year