anas-rz / specmix-pytorch
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆10Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for specmix-pytorch
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 2 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆35Updated 3 months ago
- ☆19Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆27Updated 4 months ago
- Official repository of NeXt-TDNN for speaker verification☆56Updated last month
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- Analysis of XLS-R for Speech Quality Assessment☆11Updated 3 months ago
- ☆41Updated last year
- Learning differentiable temporal resolution on time-series data.☆32Updated 2 years ago
- ☆13Updated 9 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆34Updated 11 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆31Updated 5 months ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆26Updated 6 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆21Updated 8 months ago
- ☆59Updated last month
- ☆10Updated 11 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆18Updated 10 months ago
- ☆54Updated last month
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆47Updated 2 weeks ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Streaming Audiotransformers for online Audio tagging☆41Updated 4 months ago
- ☆59Updated last year
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆42Updated last year
- ConMamba for Automatic Speech Recognition☆44Updated 3 months ago
- ☆64Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆32Updated last year
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 11 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆44Updated last week