DeepSpectrum / DeepSpectrumLite
Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks.
☆17Updated 2 years ago
Alternatives and similar repositories for DeepSpectrumLite:
Users that are interested in DeepSpectrumLite are comparing it to the libraries listed below
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- ☆30Updated last year
- ☆25Updated 3 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 8 months ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆14Updated last year
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 6 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 3 months ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆13Updated 4 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- ☆30Updated 8 months ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- ☆10Updated 3 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated 11 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Clustering-based methods for overlapping diarization☆78Updated last year
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- Adapting a ConvNeXt model to audio classification on AudioSet☆22Updated last month
- This is the official implementation of " Enhancing Embeddings for Speech Classification in Noisy Conditions"☆10Updated last year
- Da - ECHO - RetrievAl - daTasEt☆26Updated 8 months ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆66Updated 3 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- ☆62Updated 6 months ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated last year
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated 2 years ago
- ☆13Updated 2 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year