SRPOL-AUI / spectrum-correction
Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"
☆10Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for spectrum-correction
- ☆18Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆34Updated last month
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆39Updated last year
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆32Updated last year
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- experiments about AudioSet☆43Updated last year
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆13Updated this week
- ☆28Updated 5 months ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆70Updated 3 years ago
- Official PyTorch implementation of "Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement" (ICM…☆22Updated 10 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆44Updated 2 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆39Updated last year
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆20Updated last year
- acnn for text-independent speaker recognition☆9Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆83Updated 2 years ago
- ☆20Updated 10 months ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆29Updated last year
- ☆33Updated 4 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆14Updated 4 years ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 2 weeks ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆42Updated this week
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆64Updated 2 years ago
- ☆27Updated last year
- Unsupervised Representation Learning for Singing Voice Separation☆21Updated last year