JiuFengSC / ElasticASTLinks
Official code of ElasticAST (Interspeech 2024 paper)
☆34Updated last year
Alternatives and similar repositories for ElasticAST
Users that are interested in ElasticAST are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆52Updated last month
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆65Updated 5 months ago
- ☆102Updated 6 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆41Updated 3 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆44Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆39Updated 6 months ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆39Updated 4 months ago
- A toolkit dedicate for speech evaluation.☆24Updated last year
- ☆109Updated 2 months ago
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆72Updated 3 months ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34Updated last year
- [ICASSP 2024] Official code for FreGrad☆36Updated last year
- ☆55Updated 11 months ago
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆43Updated 7 months ago
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Updated last week
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆75Updated 5 months ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆81Updated 3 months ago
- [INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"☆52Updated 2 weeks ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆97Updated 3 months ago
- Prediction of sound event bounding boxes (SEBBs)☆31Updated last year
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆129Updated 2 weeks ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- Sound Event Detection (SED) paper collection☆15Updated last year
- Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.☆26Updated last week
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated last year
- ☆25Updated last year
- ☆101Updated last year
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated this week
- Official repository for FlowSE (Interspeech 2025)☆64Updated 4 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Updated last year