☆14Mar 25, 2023Updated 2 years ago
Alternatives and similar repositories for dcase-2023-baseline
Users that are interested in dcase-2023-baseline are comparing it to the libraries listed below
Sorting:
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- Code for CVSSP submission to DCASE 2021 Task 6☆36Nov 22, 2022Updated 3 years ago
- Tools for the evaluation of audio captioning.☆18May 23, 2020Updated 5 years ago
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆69Jul 19, 2025Updated 7 months ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆54Sep 20, 2025Updated 5 months ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆22Dec 17, 2025Updated 2 months ago
- ☆13Dec 12, 2025Updated 2 months ago
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 5 years ago
- ☆31Dec 2, 2020Updated 5 years ago
- Python code for handling the Clotho dataset.☆85Nov 24, 2020Updated 5 years ago
- misson☆15Aug 26, 2019Updated 6 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.☆10Jun 7, 2022Updated 3 years ago
- The active learning algorithm, mismatch-first farthest-traversal. Implementation and visualization.☆12Dec 25, 2021Updated 4 years ago
- Configuration Space Exploration Framework☆17Oct 13, 2020Updated 5 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆41Aug 29, 2024Updated last year
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆50Nov 11, 2025Updated 3 months ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Aug 22, 2023Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆42Jun 15, 2021Updated 4 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Jan 27, 2021Updated 5 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Jul 29, 2021Updated 4 years ago
- keras implementation of A Discriminative Feature Learning Approach for Deep Face Recognition based on MNIST☆10Mar 1, 2019Updated 7 years ago
- This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mes…☆10May 9, 2024Updated last year
- Audio captioning recipe☆51Oct 23, 2025Updated 4 months ago
- ☆19Jul 22, 2025Updated 7 months ago
- Multiscale Score Matching Analysis☆12Jan 19, 2023Updated 3 years ago
- Intermediate Java workshop on variables, abstraction, and design patterns ☕☆10Sep 7, 2017Updated 8 years ago
- [JRTIP 2023] Efficient Convolutional Neural Networks on Raspberry Pi for Image Classification☆10Aug 12, 2025Updated 6 months ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- Pretrained spoken language classifiers from audio.☆10Jan 21, 2021Updated 5 years ago
- ☆10Sep 18, 2021Updated 4 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- ☆14Sep 20, 2023Updated 2 years ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- ☆17Nov 7, 2023Updated 2 years ago
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆17Mar 21, 2025Updated 11 months ago