benluks/streaming-asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/benluks/streaming-asr)

benluks / streaming-asr

Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.

☆18

Alternatives and similar repositories for streaming-asr

Users that are interested in streaming-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Chaanks / stklia
View on GitHub
simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)
☆10Oct 10, 2021Updated 4 years ago
Adel-Moumen / fast_sligru
View on GitHub
☆12Mar 24, 2024Updated 2 years ago
Pliploop / SemiSupCon
View on GitHub
Semi-Supervised Contrastive Learning for music classification - towards HIL-representation learning.
☆17Jul 24, 2024Updated 2 years ago
ETH-DISCO / blap
View on GitHub
Official repo for BLAP: Bootstrapping Language-Audio Pre-training for Music Captioning presented at ICASSP 2025
☆16Nov 18, 2024Updated last year
vvolhejn / thesis
View on GitHub
ETH Zürich MSc Thesis: Accelerating Neural Audio Synthesis
☆26Apr 10, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Thytu / SMIT
View on GitHub
SMIT: A Simple Modality Integration Tool
☆15Mar 31, 2024Updated 2 years ago
lyramakesmusic / sa3-inpainter-ui
View on GitHub
SA3 medium audio inpainter — MLX SAME-L decoder + FastAPI + vanilla Svelte UI
☆20May 26, 2026Updated 2 months ago
dogancan / expected-edit-distance
View on GitHub
Expected edit distance implementation using OpenFst tools
☆11May 13, 2015Updated 11 years ago
karchkha / MSG-LD
View on GitHub
Official repository for: Simultaneous Music Separation and Generation Using Multi-Track Latent Diffusion Models
☆19Nov 21, 2025Updated 8 months ago
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
carlosabalde / mobiledetect2vcl
View on GitHub
Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…
☆14Nov 13, 2023Updated 2 years ago
TEAMuP-dev / pyharp
View on GitHub
Companion repository which facilitates the creation of Gradio endpoints which are accessible from within Digital Audio Workstations (DAWs…
☆28Updated this week
MTG / playlists-stat-analysis
View on GitHub
Tools for Analyzing Popularity and Semantic Diversity of a Playlist Dataset
☆10Jun 17, 2024Updated 2 years ago
geoffroypeeters / ssmnet_ISMIR2023
View on GitHub
☆20Oct 20, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
iamcam / ai-wordpress-rag-demo
View on GitHub
This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…
☆11Apr 2, 2024Updated 2 years ago
ashispati / dmelodies_dataset
View on GitHub
Dataset to study disentanglement in the context of symbolic music. Published as an ISMIR'20 paper titled: "dMelodies: A Music Dataset for…
☆28Oct 21, 2020Updated 5 years ago
dusty-phillips / similar-sounding-words
View on GitHub
A list of similar sounding words to help disambiguate voice coding
☆11May 20, 2020Updated 6 years ago
vackva / Orbe
View on GitHub
Binaural Spatializer Audio Plugin
☆24Jun 25, 2024Updated 2 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
wiragotama / TIARA-annotationTool
View on GitHub
An Interactive Tool for Annotating Discourse Structure and Text Improvement
☆16Sep 15, 2021Updated 4 years ago
mpourmpoulis / PythonVoiceCodingPlugin
View on GitHub
Sublime Text 3 plugin for voice coding Python 3
☆13Sep 15, 2022Updated 3 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
manoskary / weavemuse
View on GitHub
An open agentic system built on smolagents, integrating multimodal state-of-the-art music AI models for understanding, generation, and in…
☆32Feb 6, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
manoskary / SMUG-Explain
View on GitHub
A Framework for Symbolic MUsic Graph Explanations
☆11Jul 30, 2025Updated 11 months ago
kyegomez / AudioMamba
View on GitHub
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch
☆15Updated this week
kiang / map.coa.gov.tw
View on GitHub
working with data from map.coa.gov.tw
☆15Feb 26, 2018Updated 8 years ago
webpolis / musai
View on GitHub
Machine learning-powered music generation. Full-featured tokenizer, customization options, and high-quality output files.
☆15Feb 3, 2025Updated last year
brunovollmer / opencv_label_tool
View on GitHub
Video labeling tool based on OpenCV. Easily customizable.
☆12Aug 30, 2024Updated last year
ivmm / LLStack
View on GitHub
LLStack - 基于LiteSpeed的一站式高性能PHP网站解决方案/一键包
☆19Jan 14, 2022Updated 4 years ago
yongyizang / music-source-restoration
View on GitHub
Official Repository for "Music Source Restoration"
☆31Jun 1, 2025Updated last year
jorshi / sieve
View on GitHub
Audio plugin for the automatic classification and intelligent browsing of kick and snare drum sounds
☆11Feb 22, 2021Updated 5 years ago
LiDCC / MERTech
View on GitHub
Official code of ICASSP 2024 paper "MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Tas…
☆11Jun 14, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
saebyulpark / MCIC
View on GitHub
Code and Dataset for <Quantitative Analysis of Melodic Similarity in Music Copyright Infringement Cases, ISMIR 2024>
☆15Nov 12, 2024Updated last year
wolstena / varnish-bad-bot-detection
View on GitHub
A Varnish 4.0 subroutine for blocking bad bots (from http://omninoggin.com/web-development/block-unwanted-spam-bots-using-varnish-vcl/)
☆12Jan 30, 2015Updated 11 years ago
SonyCSLParis / cae-invar
View on GitHub
Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder
☆38Dec 16, 2024Updated last year
yuanxun-yx / SplitCAD
View on GitHub
Proposal for a GUI-first CAD system with source/result separation — like KiCad, but for 3D mechanical design
☆17Sep 5, 2025Updated 10 months ago
daanzu / py-silero-vad-lite
View on GitHub
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies
☆17Nov 25, 2024Updated last year
WildHoneyPie / BEAST
View on GitHub
Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…
☆44Sep 11, 2024Updated last year
caspark / factorio-a11y
View on GitHub
An accessibility mod which implements voice control for Factorio
☆14Oct 4, 2021Updated 4 years ago