lucadellalib/ts-asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucadellalib/ts-asr)

lucadellalib / ts-asr

Target speaker automatic speech recognition (TS-ASR)

☆14

Alternatives and similar repositories for ts-asr

Users that are interested in ts-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
echocatzh / Demo-of-DeepComplexAEC
View on GitHub
☆11Jun 15, 2022Updated 4 years ago
hhhaaahhhaa / ASR-TTA
View on GitHub
☆16Nov 4, 2025Updated 8 months ago
jwr1995 / PubSep
View on GitHub
Repository of published DNN speech separation recipes for a number of datasets
☆13Jan 22, 2024Updated 2 years ago
MorenoLaQuatra / vad
View on GitHub
Simple voice activity detection (VAD) algorithm in Python
☆15Aug 10, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
dodohow1011 / TS-VAD
View on GitHub
☆55Jan 15, 2021Updated 5 years ago
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
deepvk / muse
View on GitHub
🎵 muse: Music Separation
☆11Feb 14, 2024Updated 2 years ago
BUTSpeechFIT / hystoc
View on GitHub
Getting confidences from any end-to-end systems
☆11May 24, 2023Updated 3 years ago
introlab / uimvdr
View on GitHub
☆13Oct 11, 2024Updated last year
corticph / error-align
View on GitHub
Text-to-text alignment algorithm for speech recognition error analysis.
☆32Jun 23, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ancher-bohdan / stm32_usb_interface
View on GitHub
Exaple of usage different features of USB interface on STM32
☆20Apr 9, 2023Updated 3 years ago
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
yichen14 / FastAdaSP
View on GitHub
Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)
☆17Nov 14, 2024Updated last year
isadrtdinov / kws-attention
View on GitHub
Attention-based model for keywords spotting
☆19Aug 9, 2021Updated 4 years ago
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
crb02005 / gimp-segment-anything
View on GitHub
a wrapper for meta segment anything for gimp
☆18Aug 28, 2023Updated 2 years ago
semanticVAD / testsets
View on GitHub
Testing sets for semanticVAD
☆20Feb 18, 2025Updated last year
echocatzh / py-aec-unified2021
View on GitHub
☆47Jun 6, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cadia-lvl / samromur-asr
View on GitHub
Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
☆12Sep 30, 2022Updated 3 years ago
hmohebbi / disentangling_representations
View on GitHub
☆14Oct 3, 2025Updated 9 months ago
nttcslab / dcase2023_task2_evaluator
View on GitHub
☆12Aug 10, 2023Updated 2 years ago
Mo-yun / DSDPRNN
View on GitHub
Implementation of Dual-Stream DPRNN (paper: Nonlinear Residual Echo Suppression Based on Dual-Stream DPRNN)
☆21Jul 15, 2021Updated 5 years ago
Ephrem-ETH / E2E-KWS
View on GitHub
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆45Nov 18, 2022Updated 3 years ago
juice500ml / xlm_to_xlsr
View on GitHub
Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)
☆12Mar 12, 2024Updated 2 years ago
lmiguelgato / DAP_project
View on GitHub
Multiple DOA estimation & delay-and-sum beamforming
☆21Oct 13, 2020Updated 5 years ago
DeepSpectrum / DeepSpectrumLite
View on GitHub
Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks…
☆18Apr 16, 2022Updated 4 years ago
popcornell / MicRank
View on GitHub
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Apr 8, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
RapidAI / RapidPunc
View on GitHub
A library for adding punctuation into a text from ASR.
☆19May 8, 2023Updated 3 years ago
YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
AMAAI-Lab / DART
View on GitHub
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
☆16Apr 22, 2026Updated 3 months ago
alphacep / whisper-prompts
View on GitHub
OpenAI Whisper Prompt Examples
☆53Jul 17, 2023Updated 3 years ago
Den4ikAI / ruphon
View on GitHub
Простой IPA фонемизатор на базе ruaccent-encoder
☆24Apr 15, 2025Updated last year
Mu-Y / DiariST
View on GitHub
☆18Sep 19, 2023Updated 2 years ago
echocatzh / GFTNN
View on GitHub
Gated Convolutional F-T-LSTM Neural Network
☆40Jun 15, 2022Updated 4 years ago