voxos-ai/streaming-whisper-server

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voxos-ai/streaming-whisper-server)

voxos-ai / streaming-whisper-server

A streaming whisper server for on-prem transcription

☆23

Alternatives and similar repositories for streaming-whisper-server

Users that are interested in streaming-whisper-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
pengzhendong / streaming-asr
View on GitHub
One command to start a streaming ASR server.
☆12Oct 2, 2024Updated last year
jzshq208886 / wenet_asr
View on GitHub
☆12Jul 11, 2024Updated 2 years ago
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
luweigen / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆121Jan 29, 2024Updated 2 years ago
YanZiBuGuiCHunShiWan / RESTFUL_ASR
View on GitHub
基于wenet的短时在线语音识别服务
☆11Feb 25, 2023Updated 3 years ago
IS2AI / MultilingualASR
View on GitHub
☆14Aug 9, 2021Updated 4 years ago
Gelelmaster / Funasr-Qwen-GPTSovits
View on GitHub
<综合> Funasr语音识别，调用Qwen大模型回答，通过GPTSovits输出语音的ai程序，其中调用模型还是在线，后续将添加离线大模型
☆13Nov 30, 2024Updated last year
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago
kaihuhuang / Language-Group
View on GitHub
☆11Dec 24, 2024Updated last year
Miamoto / Conformer-NTM
View on GitHub
☆16Nov 9, 2023Updated 2 years ago
ysngki / XMoE
View on GitHub
☆15Oct 19, 2024Updated last year
ANonEntity / WhisperWithVAD
View on GitHub
Whisper combined with Silero VAD, for improved long-form transcriptions
☆55Dec 11, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MorenoLaQuatra / vad
View on GitHub
Simple voice activity detection (VAD) algorithm in Python
☆15Aug 10, 2023Updated 2 years ago
Navaneeth-Sharma / Akshara-Jaana
View on GitHub
A OCR Project for Reading New and Old Kannada Texts
☆10Aug 31, 2024Updated last year
lendle / OnlineLearning.jl
View on GitHub
☆14Dec 9, 2014Updated 11 years ago
huyhoang17 / Semantic_Search
View on GitHub
[DEPRECATED] Baseline Project for Semantic Searching
☆10Oct 15, 2018Updated 7 years ago
trangptm / Column_networks
View on GitHub
Column Networks for Collective Classification: A novel deep learning model for collective classification in multi-relational domains
☆12Nov 22, 2016Updated 9 years ago
PeoplePlusAI / Sthaan
View on GitHub
Sthaan uses AI to create digital addresses with local language support in voice/text, making it easier for people to find and reach locat…
☆12Nov 17, 2024Updated last year
cart / godot-inputsharp
View on GitHub
A C# abstraction on top of Godot's input events that makes life just a little bit easier
☆10Feb 1, 2018Updated 8 years ago
iakashpaul / Portal
View on GitHub
Android app for the Hole in your Palm project, making LLMs accessible on-device!
☆19May 3, 2024Updated 2 years ago
osome-iu / Botometer101
View on GitHub
This repository contains the code for the paper "Botometer 101: Social bot practicum for computational social scientists."
☆11Oct 6, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Speech-Lab-IITM / Hindi-ASR-Challenge
View on GitHub
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆10Nov 5, 2020Updated 5 years ago
SoonSYJ / fawasr
View on GitHub
FunASR安卓端侧离线版本2pass全模式
☆15Sep 4, 2023Updated 2 years ago
lukeewin / faster_whisper_streaming
View on GitHub
This is a project focused on Faster Whisper, a streaming speech recognition project.
☆18Sep 27, 2024Updated last year
gokulkarthik / text2speech
View on GitHub
Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
☆57May 7, 2023Updated 3 years ago
tuanio / conformer-rnnt
View on GitHub
Conformer RNN-Transducer
☆14May 25, 2022Updated 4 years ago
DongKeon / webrtc-whisper-asr
View on GitHub
WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.
☆13Sep 27, 2024Updated last year
protonx-tf-06-projects / lora-experiment-1
View on GitHub
Use LoRA technique to improve training Large Language Model
☆13Jul 25, 2023Updated 3 years ago
zhoutuan / mod_funasr
View on GitHub
FreeSWITCH ASR module fork from mod_audio_stream， use FunASR online cpu version
☆20Jun 27, 2025Updated last year
QuadraV-Speech / funasr_seaco_paraformer_onnx_with_timestamp
View on GitHub
修复funasr中seaco-paraformer导出onnx后没有时间戳的bug
☆25Sep 12, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Full-Stack-Data-Science / real-time-ml-inference-with-spark-streaming-and-kafka
View on GitHub
FSDS Webinar 1: Real-Time Machine Learning Inference with Spark Streaming and Kafka
☆10Feb 17, 2025Updated last year
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
miclast / FreePBX-Call-intrusion
View on GitHub
Intrusion. Custom Asterisk dial plan for listen, whisper and barge in calls. For Asterisk FreePBX, Issabel, Asterisk based Elastix call c…
☆16Jul 9, 2021Updated 5 years ago
frankyoujian / Edge-Punct-Casing
View on GitHub
☆33Feb 4, 2025Updated last year
AshwaniRajput87 / Full_stack_may_2023
View on GitHub
☆10Sep 10, 2023Updated 2 years ago
vonage-garage-rip / AnsweringMachineDetection
View on GitHub
☆15Dec 8, 2022Updated 3 years ago
fengredrum / finetune-whisper-lora
View on GitHub
Fine-Tune Whisper with Transformers and PEFT
☆58Nov 4, 2023Updated 2 years ago