swagshaw / ASC-CLLinks
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Updated 3 years ago
Alternatives and similar repositories for ASC-CL
Users that are interested in ASC-CL are comparing it to the libraries listed below
Sorting:
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- ICSD Dataset☆37Updated 5 months ago
- ☆18Updated 3 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated last year
- ☆17Updated last year
- ☆32Updated 11 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 4 years ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆45Updated 2 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆41Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Updated 4 years ago
- ☆27Updated 2 years ago
- ☆45Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 3 years ago
- ☆66Updated last year
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆49Updated last year
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- ☆31Updated 2 years ago
- ☆59Updated last year
- ☆19Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Updated 11 months ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated 5 months ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Updated 2 years ago