apple-yinhan / Noise-robust-SEDView external linksLinks
☆13Jan 2, 2025Updated last year
Alternatives and similar repositories for Noise-robust-SED
Users that are interested in Noise-robust-SED are comparing it to the libraries listed below
Sorting:
- ☆22Mar 19, 2025Updated 10 months ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆17Nov 19, 2024Updated last year
- Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…☆15Sep 1, 2024Updated last year
- ☆13Oct 11, 2024Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- Continual Learning Benchmark for Spoken Keyword Spotting☆17Jun 7, 2022Updated 3 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- Collection of DCASE related datasets☆18Jun 12, 2025Updated 8 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated last month
- Keras/Pytorch neural network size, operations and parameters counter☆16Mar 23, 2023Updated 2 years ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Jan 27, 2025Updated last year
- ☆32Dec 24, 2025Updated last month
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- [Not Official] Implementation of TC-Resnet, INTERSPEECH 2019☆22Jan 24, 2024Updated 2 years ago
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated last year
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 3 months ago
- ☆28Oct 17, 2024Updated last year
- ☆26Nov 2, 2022Updated 3 years ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆33May 7, 2025Updated 9 months ago
- ☆28Mar 14, 2023Updated 2 years ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆32Mar 8, 2024Updated last year
- ☆40Jul 15, 2025Updated 7 months ago
- MSP-Podcast Challenge Baseline Code☆30Jun 12, 2024Updated last year
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆56Dec 23, 2025Updated last month
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆195Dec 13, 2024Updated last year
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆50Sep 20, 2025Updated 4 months ago
- Da - ECHO - RetrievAl - daTasEt☆34Jul 7, 2024Updated last year
- Baselines for IS25 Source Tracing Special Session☆33Jan 3, 2025Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 6 months ago
- Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)☆32Nov 11, 2020Updated 5 years ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- ☆29Jun 15, 2022Updated 3 years ago
- ☆37Jun 30, 2022Updated 3 years ago
- ☆32Aug 10, 2022Updated 3 years ago
- ☆54Nov 14, 2025Updated 3 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆36Aug 7, 2024Updated last year
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- ☆41May 19, 2023Updated 2 years ago