YUCHEN005/Gradient-Remedy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YUCHEN005/Gradient-Remedy)

YUCHEN005 / Gradient-Remedy

Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"

☆21

Alternatives and similar repositories for Gradient-Remedy

Users that are interested in Gradient-Remedy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YUCHEN005 / UNA-GAN
View on GitHub
Code for paper "Unsupervised Noise adaptation using Data Simulation"
☆14May 16, 2024Updated 2 years ago
YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
YUCHEN005 / Unified-Enhance-Separation
View on GitHub
Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
☆45Jul 10, 2024Updated 2 years ago
YUCHEN005 / MIR-GAN
View on GitHub
Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recogni…
☆16Jun 21, 2023Updated 3 years ago
YUCHEN005 / GILA
View on GitHub
Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"
☆18Jun 21, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YUCHEN005 / DPSL-ASR
View on GitHub
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆44May 23, 2023Updated 3 years ago
YUCHEN005 / UniVPM
View on GitHub
Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"
☆28Jun 21, 2023Updated 3 years ago
Hypotheses-Paradise / Hypo2Trans
View on GitHub
Single-blind supplementary materials for NeurIPS 2023 submission
☆94Oct 30, 2024Updated last year
shikiw / Modality-Integration-Rate
View on GitHub
[ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…
☆113Jul 9, 2025Updated last year
YUCHEN005 / RobustGER
View on GitHub
Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"
☆143May 8, 2024Updated 2 years ago
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
ChangLee0903 / D4AM
View on GitHub
☆16May 26, 2022Updated 4 years ago
YUCHEN005 / STAR-Adapt
View on GitHub
Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"
☆241May 24, 2024Updated 2 years ago
YUCHEN005 / GenTranslate
View on GitHub
Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"
☆199Jul 22, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
echocatzh / GTCNN
View on GitHub
Personalized AEC
☆19Nov 3, 2022Updated 3 years ago
AiTeRLab-GIST / GC_track4_violence_detection_GIST
View on GitHub
Grand Challenge 4 track 2 sourcecode developed by GIST
☆13Mar 24, 2021Updated 5 years ago
shirley-wu / daco
View on GitHub
[NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation
☆14Mar 5, 2025Updated last year
Tonyyouyou / Mamba-in-Speech
View on GitHub
☆56Jul 1, 2024Updated 2 years ago
Jiamim / Algorithm_for_Interview-Chinese
View on GitHub
Algorithm for Interview（面试算法笔记-中文）
☆10Jul 23, 2018Updated 8 years ago
swagshaw / Rainbow-Keywords
View on GitHub
Rainbow Keywords - Official PyTorch Implementation
☆14Jun 27, 2024Updated 2 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
jin-woo-lee / nfs-binaural
View on GitHub
☆13Aug 13, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
XiangzhuKong / CA-Dense-UNet
View on GitHub
An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement
☆13Jul 17, 2023Updated 3 years ago
intflow / KICT_GC2020_eval500
View on GitHub
Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3
☆13Mar 4, 2020Updated 6 years ago
NikolaiKyhne / MambAttention
View on GitHub
Official repository for the paper "MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement" (A…
☆35Mar 25, 2026Updated 3 months ago
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
LiaoEuan / SincAlignNet
View on GitHub
This implementation is based on the SincAlignNet model from the paper 'Frequency-Based Alignment of EEG and Audio Signals Using Contrasti…
☆14Jul 28, 2025Updated 11 months ago
KhanhNguyen4999 / Speech-Enhancement-CLSKD
View on GitHub
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement
☆11Jun 22, 2023Updated 3 years ago
Audio-Experience-Design / LAPChallenge
View on GitHub
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
☆16Aug 12, 2025Updated 11 months ago
jjery2243542 / semi-supervised-ASR
View on GitHub
☆10Dec 16, 2018Updated 7 years ago
MyParadise21 / Mamba-SEUNet
View on GitHub
This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement
☆95May 26, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
huaidanquede / MUSE-Speech-Enhancement
View on GitHub
Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…
☆58Mar 5, 2025Updated last year
kaistmm / Metric-UD-KWS
View on GitHub
Official code for Metric learning for user-defined keyword spotting
☆40Feb 21, 2024Updated 2 years ago
lordet01 / SE_SNMF_NAT
View on GitHub
Speech enhancement (Interspeech 2016, Ideal)
☆19Jun 25, 2022Updated 4 years ago
jsvir / vad
View on GitHub
[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection
☆40Mar 24, 2025Updated last year
rickgroen / cov-weighting
View on GitHub
Implementation for our WACV 2021 paper "Multi-Loss Weighting with Coefficient of Variations"
☆54Jan 11, 2021Updated 5 years ago
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
dianwen-ng / Keyword-Spotting-ConvMixer
View on GitHub
☆33Aug 10, 2022Updated 3 years ago