felixgontier/dcase-2023-baseline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/felixgontier/dcase-2023-baseline)

felixgontier / dcase-2023-baseline

☆14

Alternatives and similar repositories for dcase-2023-baseline

Users that are interested in dcase-2023-baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xieh97 / dcase2023-audio-retrieval
View on GitHub
Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge
☆10Aug 8, 2023Updated 2 years ago
XinhaoMei / DCASE2021_task6_v2
View on GitHub
Code for CVSSP submission to DCASE 2021 Task 6
☆36Nov 22, 2022Updated 3 years ago
audio-captioning / caption-evaluation-tools
View on GitHub
Tools for the evaluation of audio captioning.
☆19May 23, 2020Updated 6 years ago
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
Labbeti / aac-metrics
View on GitHub
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
☆75Mar 22, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
2019nmid / summer
View on GitHub
misson
☆15Aug 26, 2019Updated 6 years ago
mae-creative-pc / cpac_course_2024-25
View on GitHub
☆13Dec 12, 2025Updated 7 months ago
ExplainableML / ZerAuCap
View on GitHub
[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords
☆19Nov 30, 2024Updated last year
GATECH-EIC / S3-Router
View on GitHub
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Sep 19, 2023Updated 2 years ago
ahsanMah / msma
View on GitHub
Multiscale Score Matching Analysis
☆11Jan 19, 2023Updated 3 years ago
aeromamba-super-resolution / aeromamba
View on GitHub
Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…
☆50Nov 11, 2025Updated 8 months ago
Labbeti / conette-audio-captioning
View on GitHub
CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding
☆23Dec 17, 2025Updated 7 months ago
crypto-code / Music-Representation-Comparison
View on GitHub
This is the repo with the code to conduct a comparative analysis of different audio representation models.
☆11Aug 31, 2023Updated 2 years ago
WUT-AI / hypersound
View on GitHub
Source code for paper "Hypernetworks build Implicit Neural Representations of Sounds" from ECML 2023
☆13Jun 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wsntxxn / AudioCaption
View on GitHub
Audio captioning recipe
☆53Oct 23, 2025Updated 9 months ago
Beyond-Zw / DLAN-AC
View on GitHub
☆11Feb 1, 2024Updated 2 years ago
audio-captioning / dcase-2020-baseline
View on GitHub
Audio captioning baseline system for DCASE 2020 challenge.
☆38Aug 22, 2023Updated 2 years ago
tcc-power / FRD-unsupervised-video-anomaly-detection
View on GitHub
official codes for FRD-UVAD(10 crop version)
☆16Nov 2, 2024Updated last year
XinhaoMei / ACT
View on GitHub
Source code for the paper 'Audio Captioning Transformer'
☆56Jan 18, 2022Updated 4 years ago
gudgud96 / noisy-student-emotion-training
View on GitHub
Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging
☆11Dec 2, 2021Updated 4 years ago
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
sadhusamik / fdlp_spectrogram
View on GitHub
☆14Nov 28, 2022Updated 3 years ago
sh01k / AmplitudeMatching
View on GitHub
A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple …
☆15Mar 30, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
audio-captioning / audio-captioning-resources
View on GitHub
A list of resources that can help in research for automated audio captioning
☆34Feb 17, 2021Updated 5 years ago
Kota-Dohi / dcase2022_task2_baseline_mobile_net_v2
View on GitHub
☆16Dec 7, 2022Updated 3 years ago
GWLee0524 / AMTL
View on GitHub
Asymmetric Multi-Task Learning code, If you want to use it, please let me know and cite AMTL paper
☆11Aug 3, 2016Updated 9 years ago
d2l-ai / d2l-zh-tensorflow-colab
View on GitHub
Automatically Generated d2l-zh TensorFlow Notebooks for Colab
☆12Aug 18, 2023Updated 2 years ago
nikolakopoulos / Personalized-Diffusions
View on GitHub
Personalized Item Exploration Processes for Recommendation
☆15Sep 19, 2019Updated 6 years ago
audio-captioning / clotho-dataset
View on GitHub
Python code for handling the Clotho dataset.
☆85Nov 24, 2020Updated 5 years ago
DCASE2024-Task7-Sound-Scene-Synthesis / AudioLDM-training-finetuning
View on GitHub
AudioLDM training, finetuning, evaluation and inference.
☆14Mar 27, 2024Updated 2 years ago
raymondxu / java-workshop
View on GitHub
Intermediate Java workshop on variables, abstraction, and design patterns ☕
☆10Sep 7, 2017Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
biboamy / music-repro
View on GitHub
☆17Nov 7, 2023Updated 2 years ago
raymondxyy / strfnet-IS2020
View on GitHub
Official repo for the STRFNet system appeared in INTERSPEECH2020
☆12Mar 6, 2021Updated 5 years ago
raotnameh / End-to-end-E2E-Named-Entity-Recognition-from-English-Speech
View on GitHub
☆32Dec 2, 2020Updated 5 years ago
sharathadavanne / seld-dcase2021
View on GitHub
Baseline method for sound event localization task of DCASE 2021 challenge
☆45Jun 15, 2021Updated 5 years ago
Bond1995 / Markov
View on GitHub
Code for experiments on transformers using Markovian data.
☆22Nov 22, 2024Updated last year
lucacoma / NeuralBeamspaceDomainFilter
View on GitHub
Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…
☆19Oct 21, 2022Updated 3 years ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago