☆32Apr 1, 2023Updated 2 years ago
Alternatives and similar repositories for dcase2023_task7_baseline
Users that are interested in dcase2023_task7_baseline are comparing it to the libraries listed below
Sorting:
- ☆14Sep 20, 2023Updated 2 years ago
- Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021☆69Sep 3, 2021Updated 4 years ago
- RWCP-SSD-Onomatopoeia☆23Jun 28, 2023Updated 2 years ago
- ☆12Jun 9, 2025Updated 8 months ago
- ☆50Jun 14, 2022Updated 3 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Localization package using distance and/or angle measurements☆16Mar 11, 2022Updated 3 years ago
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 3 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34May 25, 2024Updated last year
- ☆15Mar 30, 2020Updated 5 years ago
- AudioLDM text to audio colab☆19Nov 6, 2023Updated 2 years ago
- Reproduction of "Scyclone" with PyTorch☆16Jan 6, 2021Updated 5 years ago
- Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing☆19Apr 10, 2024Updated last year
- Deep neural network for audio super-resolution tasks☆15Sep 6, 2020Updated 5 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆71Feb 10, 2022Updated 4 years ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Aug 22, 2023Updated 2 years ago
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆23May 9, 2023Updated 2 years ago
- Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)☆370Jul 12, 2024Updated last year
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Mar 25, 2024Updated last year
- A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.☆16Feb 9, 2025Updated last year
- Time-domain Audio Separation Network (IN PYTORCH)☆23Jan 28, 2019Updated 7 years ago
- A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.☆32Dec 17, 2024Updated last year
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated last year
- Unofficial download repository for MusicCaps☆47Apr 21, 2023Updated 2 years ago
- ☆23Apr 25, 2022Updated 3 years ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆24Jul 4, 2022Updated 3 years ago
- This repository contains the trained models and some audio samples for the tPLCnet.☆28Sep 26, 2023Updated 2 years ago
- ☆27Sep 13, 2021Updated 4 years ago
- ☆25Feb 28, 2023Updated 3 years ago
- Domestic environment sound event detection task☆155Jun 11, 2024Updated last year
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆51Jun 12, 2025Updated 8 months ago
- Experiments from the paper "Sinusoidal Frequency Estimation by Gradient Descent"☆61Mar 8, 2023Updated 2 years ago
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Aug 20, 2020Updated 5 years ago
- unofficial PyTorch implementation of 《REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET》☆105May 26, 2022Updated 3 years ago
- ☆117Updated this week
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Nov 30, 2020Updated 5 years ago
- Keras framework for speech enhancement using relativistic GANs☆52Jun 24, 2020Updated 5 years ago
- ☆29Jul 4, 2025Updated 8 months ago