DeepSpectrum / DeepSpectrumLite
Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks.
☆17Updated 3 years ago
Alternatives and similar repositories for DeepSpectrumLite:
Users that are interested in DeepSpectrumLite are comparing it to the libraries listed below
- Streaming Audiotransformers for online Audio tagging☆44Updated 10 months ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 7 months ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆22Updated 2 months ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 8 months ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆15Updated this week
- ☆30Updated last year
- ☆20Updated 6 months ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆28Updated 9 months ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 4 months ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆21Updated last year
- This is the official implementation of " Enhancing Embeddings for Speech Classification in Noisy Conditions"☆10Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- Test Framework for few-shot open set KWS☆31Updated 5 months ago
- Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…☆56Updated last year
- Python library for rapid prototyping of environmental sound analysis systems☆42Updated 2 years ago
- MSP-Podcast Challenge Baseline Code☆21Updated 10 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated 2 years ago
- Implementation of Phase-aware speech enhancement with deep complex U-Net☆39Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- ☆13Updated last year
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- ☆11Updated 2 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- ☆17Updated last week