swagshaw / ASC-CLLinks
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Updated 3 years ago
Alternatives and similar repositories for ASC-CL
Users that are interested in ASC-CL are comparing it to the libraries listed below
Sorting:
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆12Updated 10 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- ICSD Dataset☆36Updated 4 months ago
- ☆17Updated last year
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Updated 2 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆69Updated 3 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Updated 4 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 3 years ago
- ☆27Updated last year
- ☆66Updated last year
- ☆18Updated 3 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆24Updated last year
- ☆31Updated 2 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 3 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆12Updated 11 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- ☆32Updated 11 months ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Updated 2 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 3 years ago
- ☆58Updated 2 years ago
- ☆59Updated last year
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year