tuanio / conformer-rnntLinks

Conformer RNN-Transducer

☆14

Alternatives and similar repositories for conformer-rnnt

Users that are interested in conformer-rnnt are comparing it to the libraries listed below

Sorting:

xi-j / Mamba-ASR
ConMamba for Automatic Speech Recognition
☆78Updated 11 months ago
sinhat98 / adapter-wavlm
☆43Updated 2 years ago
idiap / contextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆20Updated last year
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆81Updated last year
mubingshen / MLC-SLM-Baseline
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆38Updated 2 months ago
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆29Updated 2 years ago
dmlguq456 / NeXt_TDNN_ASV
Official repository of NeXt-TDNN for speaker verification
☆73Updated 9 months ago
vocaliodmiku / wav2vec2mdd-Text
☆18Updated 3 years ago
mkunes / w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆43Updated last year
tango4j / llm_speaker_tagging
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆14Updated last year
MingLunHan / CIF-HieraDist
[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation
☆40Updated last year
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 6 months ago
VoxBlink / ScriptsForVoxBlink
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆28Updated last year
Miamoto / Conformer-NTM
☆15Updated last year
Diamondfan / Child-ASR-Paper
A list of papers for child ASR
☆43Updated 9 months ago
SpeechColab / GigaSpeech2
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
☆161Updated 3 weeks ago
aizhiqi-work / MM-KWS
Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"
☆33Updated 2 months ago
Audio-WestlakeU / UMA-ASR
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆28Updated 7 months ago
tomasJwYU / AutoPrepDemo
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
☆31Updated last year
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆107Updated 3 years ago
ncsoft / PhonMatchNet
Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
☆50Updated last year
microsoft / NOTSOFAR1-Challenge
NOTSOFAR-1 Challenge: Distant Diarization and ASR
☆54Updated 5 months ago
Srijith-rkr / KAUST-Whisper-Adapter
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
☆36Updated last year
YUCHEN005 / DPSL-ASR
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆41Updated 2 years ago
NaoyukiKanda / LibriSpeechMix
☆35Updated 4 years ago
Liangzheng-ZL / BEdit-TTS
Speech samples and code of BEdit-TTS
☆33Updated last year
X-LANCE / KWStreamingSearch
☆64Updated 3 weeks ago
yanghaha0908 / FastHuBERT
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆94Updated 8 months ago
WangHelin1997 / SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…
☆77Updated last year
MrSupW / ICMC-ASR_Baseline
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
☆52Updated last year