YUCHEN005/DPSL-ASR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YUCHEN005/DPSL-ASR)

YUCHEN005 / DPSL-ASR

Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"

☆44

Alternatives and similar repositories for DPSL-ASR

Users that are interested in DPSL-ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
YUCHEN005 / UNA-GAN
View on GitHub
Code for paper "Unsupervised Noise adaptation using Data Simulation"
☆14May 16, 2024Updated 2 years ago
YUCHEN005 / Gradient-Remedy
View on GitHub
Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"
☆21May 24, 2023Updated 3 years ago
YUCHEN005 / GILA
View on GitHub
Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"
☆18Jun 21, 2023Updated 3 years ago
YUCHEN005 / MIR-GAN
View on GitHub
Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recogni…
☆16Jun 21, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
YUCHEN005 / Unified-Enhance-Separation
View on GitHub
Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
☆45Jul 10, 2024Updated 2 years ago
YUCHEN005 / UniVPM
View on GitHub
Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"
☆28Jun 21, 2023Updated 3 years ago
archiki / Robust-E2E-ASR
View on GitHub
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆49Dec 25, 2024Updated last year
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
YUCHEN005 / NASE
View on GitHub
Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"
☆89Jun 10, 2024Updated 2 years ago
Hypotheses-Paradise / Hypo2Trans
View on GitHub
Single-blind supplementary materials for NeurIPS 2023 submission
☆94Oct 30, 2024Updated last year
LeonWlw / asr_blockformer
View on GitHub
E2E ASR system
☆14Oct 20, 2022Updated 3 years ago
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
YUCHEN005 / RobustGER
View on GitHub
Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"
☆143May 8, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
shikiw / Modality-Integration-Rate
View on GitHub
[ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…
☆113Jul 9, 2025Updated last year
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
dianwen-ng / Keyword-Spotting-ConvMixer
View on GitHub
☆33Aug 10, 2022Updated 3 years ago
lvrysis / Audio-DNN-Classification
View on GitHub
Deep Neural Networks for audio classification
☆10Apr 11, 2024Updated 2 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
View on GitHub
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Jan 28, 2019Updated 7 years ago
Xianchao-Wu / wenet-deep-sparse-conformer
View on GitHub
☆15Aug 25, 2022Updated 3 years ago
Miamoto / Conformer-NTM
View on GitHub
☆16Nov 9, 2023Updated 2 years ago
swagshaw / Rainbow-Keywords
View on GitHub
Rainbow Keywords - Official PyTorch Implementation
☆14Jun 27, 2024Updated 2 years ago
YUCHEN005 / STAR-Adapt
View on GitHub
Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"
☆241May 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pengzhendong / welm
View on GitHub
One command to build TLG.fst for WeNet.
☆30Oct 11, 2022Updated 3 years ago
NKU-HLT / KNN-CTC
View on GitHub
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
☆42Mar 20, 2024Updated 2 years ago
jctian98 / e2e_lfmmi
View on GitHub
E2E system with LF-MMI; word N-gram for Mandarin
☆167Apr 29, 2022Updated 4 years ago
Audio-WestlakeU / UMA-ASR
View on GitHub
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆35Dec 17, 2024Updated last year
YUCHEN005 / GenTranslate
View on GitHub
Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"
☆199Jul 22, 2024Updated 2 years ago
mct10 / CoBERT
View on GitHub
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆48Nov 8, 2023Updated 2 years ago
prairie-schooner / wav2vec-vc
View on GitHub
☆10Mar 22, 2023Updated 3 years ago
slub / docsa
View on GitHub
SLUB Document Classification and Similarity Analysis
☆10Aug 31, 2023Updated 2 years ago
mechanicalsea / lighthubert
View on GitHub
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆73Sep 26, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
DanielLin94144 / Test-time-adaptation-ASR-SUTA
View on GitHub
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…
☆23Apr 1, 2022Updated 4 years ago
russellgeum / Temporal-Convolution-Resnet
View on GitHub
[Not Official] Implementation of TC-Resnet, INTERSPEECH 2019
☆22Jan 24, 2024Updated 2 years ago
swagshaw / TorchKWS
View on GitHub
Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.
☆41Apr 5, 2024Updated 2 years ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
KrishnaDN / Keyword-Transformer
View on GitHub
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23May 19, 2021Updated 5 years ago