anas-rz / specmix-pytorch
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆11Updated 2 years ago
Alternatives and similar repositories for specmix-pytorch:
Users that are interested in specmix-pytorch are comparing it to the libraries listed below
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆37Updated 5 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 4 months ago
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- ☆62Updated 4 months ago
- ☆19Updated last year
- ☆14Updated last year
- ☆64Updated last year
- Official repository of NeXt-TDNN for speaker verification☆65Updated 3 months ago
- ☆44Updated last year
- This package aims at simplifying the download of the AudioSet dataset.☆45Updated last year
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆22Updated 10 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆35Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated last year
- ☆11Updated last year
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated last year
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆33Updated 7 months ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆96Updated 10 months ago
- ☆51Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆65Updated 2 years ago
- TODO☆37Updated last year
- ☆53Updated 11 months ago
- ConMamba for Automatic Speech Recognition☆53Updated 5 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆32Updated 6 months ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆22Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆25Updated 4 months ago