Ubenwa / cryceleb2023Links
☆12Updated last year
Alternatives and similar repositories for cryceleb2023
Users that are interested in cryceleb2023 are comparing it to the libraries listed below
Sorting:
- ☆30Updated 3 years ago
- ☆94Updated 2 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆110Updated last year
- Machine learning speaker characteristics☆37Updated 2 weeks ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆27Updated 8 months ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆42Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆150Updated 2 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆22Updated 11 months ago
- EVAR ~ Evaluation package for Audio Representations☆64Updated this week
- ☆92Updated 2 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 3 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆128Updated last year
- AudioLDM training, finetuning, evaluation and inference.☆14Updated last year
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆44Updated 8 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆40Updated 2 weeks ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- This package aims at simplifying the download of the AudioSet dataset.☆54Updated last month
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆86Updated 4 years ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆65Updated 3 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆48Updated 3 months ago
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Updated last year
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆13Updated last year
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆50Updated 2 years ago
- ☆18Updated 4 years ago
- ☆28Updated 2 years ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆20Updated 8 months ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆28Updated last year
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Updated 2 years ago