felixgontier / dcase-2023-baselineView external linksLinks
☆14Mar 25, 2023Updated 2 years ago
Alternatives and similar repositories for dcase-2023-baseline
Users that are interested in dcase-2023-baseline are comparing it to the libraries listed below
Sorting:
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- Code for CVSSP submission to DCASE 2021 Task 6☆36Nov 22, 2022Updated 3 years ago
- Tools for the evaluation of audio captioning.☆18May 23, 2020Updated 5 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Sep 20, 2025Updated 4 months ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆22Dec 17, 2025Updated 2 months ago
- ☆13Dec 12, 2025Updated 2 months ago
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 4 years ago
- ☆31Dec 2, 2020Updated 5 years ago
- Python code for handling the Clotho dataset.☆85Nov 24, 2020Updated 5 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Configuration Space Exploration Framework☆17Oct 13, 2020Updated 5 years ago
- Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.☆10Jun 7, 2022Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆49Nov 11, 2025Updated 3 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Aug 29, 2024Updated last year
- Audio captioning baseline system for DCASE 2020 challenge.☆38Aug 22, 2023Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆42Jun 15, 2021Updated 4 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Jan 27, 2021Updated 5 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated 11 months ago
- Multiscale Score Matching Analysis☆12Jan 19, 2023Updated 3 years ago
- ☆17Nov 7, 2023Updated 2 years ago
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Jul 29, 2021Updated 4 years ago
- Repository of files shared during OpenPlanetary Data Cafés☆11Sep 15, 2022Updated 3 years ago
- This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mes…☆10May 9, 2024Updated last year
- Audio captioning recipe☆51Oct 23, 2025Updated 3 months ago
- ☆10Sep 18, 2021Updated 4 years ago
- YoloV6 for a bare Raspberry Pi using ncnn.☆11Jun 12, 2024Updated last year
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆17Mar 21, 2025Updated 10 months ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- Repository for "Condolence and Empathy in Online Communities", EMNLP 2020☆10Nov 9, 2020Updated 5 years ago
- keras implementation of A Discriminative Feature Learning Approach for Deep Face Recognition based on MNIST☆10Mar 1, 2019Updated 6 years ago
- PyTorch reimplementation of per-channel energy normalization for audio.☆104Mar 29, 2019Updated 6 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago