deegy666 / ADD-RSCLinks
Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’
☆20Updated 4 months ago
Alternatives and similar repositories for ADD-RSC
Users that are interested in ADD-RSC are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- ☆11Updated 2 years ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆26Updated last month
- ☆11Updated 2 years ago
- SimplifiedTransformer simplifies transformer block without affecting training. Skip connections, projection parameters, sequential sub-bl…☆15Updated 3 weeks ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21Updated 6 months ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Updated last year
- ☆11Updated last year
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆12Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- ☆18Updated 7 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated 2 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Updated 2 years ago
- SRTNet☆24Updated 2 years ago
- ESLTTS dataset☆16Updated 10 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Updated last year
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆15Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Updated 9 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆14Updated last year
- Audio-Visual Speech Enhancement Challenge (AVSE) 2024☆11Updated 4 months ago
- speaker-disentangled speech linguistic content quantizer☆24Updated 9 months ago
- PyTorch-based implementations of short-time Fourier transform☆15Updated 5 months ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆28Updated 2 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆43Updated 6 months ago
- ☆13Updated 4 months ago