Real-time Speech To Text using Faster Whisper.
☆61Aug 12, 2024Updated last year
Alternatives and similar repositories for Real-time-STT
Users that are interested in Real-time-STT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆60Aug 12, 2024Updated last year
- Kamailio in Kubernetes configuration manager☆14May 2, 2019Updated 7 years ago
- ☆14Apr 1, 2025Updated last year
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆12Jun 23, 2024Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,643Nov 12, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Sep 8, 2024Updated last year
- It summerizes the algorithms of Machine Learning.☆12Oct 26, 2025Updated 8 months ago
- State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: …☆21Apr 16, 2025Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- openvino version of openai/whisper☆15Jun 19, 2026Updated last week
- A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.☆22Jun 12, 2026Updated 2 weeks ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆360Jul 20, 2025Updated 11 months ago
- ☆21May 11, 2024Updated 2 years ago
- ☆16May 14, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Open Source Android App for remote control of model cars/boats etc. based on an ESP8266/ESP32 with a camera.☆16Jan 12, 2025Updated last year
- Video Surveillance SDK☆11Mar 2, 2022Updated 4 years ago
- Compute WER and SER for speech recognition evaluation☆26Jun 6, 2026Updated 3 weeks ago
- A tutorial for runing LLM in Andriod Termux with Vulkan GPU acceleration☆17Jan 27, 2025Updated last year
- ☆33Mar 26, 2025Updated last year
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Sep 6, 2023Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Hpyformer base FunASR☆31Nov 5, 2024Updated last year
- ☆11Dec 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆20May 24, 2024Updated 2 years ago
- Real time transcription with OpenAI Whisper.☆2,937Apr 15, 2025Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Jun 16, 2026Updated last week
- ☆12Jul 11, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- ODAS: Open embeddeD Audition System☆11Mar 20, 2021Updated 5 years ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- 📞 AGI interface with python for speech recognition☆30Feb 19, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- ☆18May 15, 2025Updated last year
- STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지☆72Jun 18, 2025Updated last year
- Telegram bot to interact with ollama models☆18Mar 30, 2025Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 10 months ago
- A terminal chatbot, powered by Groq Cloud API (Windows / macOS / Linux / Android / iOS)☆17Mar 6, 2025Updated last year
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago