xieh97/dcase2023-audio-retrieval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xieh97/dcase2023-audio-retrieval)

xieh97 / dcase2023-audio-retrieval

Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge

☆10

Alternatives and similar repositories for dcase2023-audio-retrieval

Users that are interested in dcase2023-audio-retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

felixgontier / dcase-2023-baseline
View on GitHub
☆14Mar 25, 2023Updated 3 years ago
XinhaoMei / DCASE2021_task6_v2
View on GitHub
Code for CVSSP submission to DCASE 2021 Task 6
☆36Nov 22, 2022Updated 3 years ago
audio-captioning / caption-evaluation-tools
View on GitHub
Tools for the evaluation of audio captioning.
☆19May 23, 2020Updated 6 years ago
alexjc / nanogpt-speedrun
View on GitHub
NanoGPT (124M) in 5 minutes
☆16Feb 14, 2025Updated last year
gudgud96 / noisy-student-emotion-training
View on GitHub
Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging
☆11Dec 2, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sadhusamik / fdlp_spectrogram
View on GitHub
☆14Nov 28, 2022Updated 3 years ago
d2l-ai / d2l-zh-tensorflow-colab
View on GitHub
Automatically Generated d2l-zh TensorFlow Notebooks for Colab
☆12Aug 18, 2023Updated 2 years ago
raymondxu / java-workshop
View on GitHub
Intermediate Java workshop on variables, abstraction, and design patterns ☕
☆10Sep 7, 2017Updated 8 years ago
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
raymondxyy / strfnet-IS2020
View on GitHub
Official repo for the STRFNet system appeared in INTERSPEECH2020
☆12Mar 6, 2021Updated 5 years ago
zyy-fc / CGMM-MVDR
View on GitHub
☆10Aug 3, 2020Updated 5 years ago
tango4j / llm_speaker_tagging
View on GitHub
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated 2 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
popcornell / OSDC
View on GitHub
☆18Jan 26, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pxiangwu / FORB
View on GitHub
"FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding", NeurIPS 2023 Datasets and Benchmarks Track
☆12Jun 20, 2024Updated 2 years ago
fran-martinez / bio_ner_bert
View on GitHub
BERT finetuned on NER downstream tasks
☆15Jun 12, 2023Updated 3 years ago
azer / gezi
View on GitHub
Gezi Web Browser
☆17May 6, 2015Updated 11 years ago
BrainyMachines / fgvc5-cvpr2018-food-fashion
View on GitHub
Fine Grained Visual Categorization
☆11Jun 16, 2018Updated 8 years ago
openplanetary / op-data-cafe
View on GitHub
Repository of files shared during OpenPlanetary Data Cafés
☆11Sep 15, 2022Updated 3 years ago
moomou / ekho
View on GitHub
A simple way to add voice interaction to your site
☆15Sep 4, 2015Updated 10 years ago
3loi / NaturalVoices
View on GitHub
☆61Oct 22, 2025Updated 9 months ago
lifeiteng / NotebookTTS
View on GitHub
Text-To-Speech for NotebookLM
☆39Jul 20, 2025Updated last year
primepake / learnable-speech
View on GitHub
This repo is text to speech with learnable audio encoder without alignment with transcript reference
☆54Sep 20, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Lollipop / Qwen2-Audio
View on GitHub
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
☆40Sep 8, 2024Updated last year
ssprl / Real-time-Blind-source-separation-using-IVA
View on GitHub
☆16Apr 24, 2021Updated 5 years ago
bandhit / doa-gmm-music-fusion
View on GitHub
Sound Angle Estimation by Fusion of Gaussian Mixture Model and Multiple Signal classification
☆12Jan 31, 2019Updated 7 years ago
Mayank-Bhatia / UrbanSound_Classification
View on GitHub
Sound classification using neural networks
☆12Jun 6, 2018Updated 8 years ago
angsaikia / voice-filter
View on GitHub
Unofficial Tensorflow/Keras implementation of Google AI VoiceFilter
☆16Mar 25, 2023Updated 3 years ago
LogicJake / 2020-Xiamen-International-Bank-Financial-Cup
View on GitHub
2020厦门国际银行数创金融杯建模大赛-优胜奖方案
☆11Feb 2, 2021Updated 5 years ago
jperl / angular-jquery-mobile
View on GitHub
A todo example with angular, jquery mobile, and tested with karma & travis. Explanation video here http://youtu.be/hNAHogdcus0
☆16Sep 26, 2018Updated 7 years ago
crtr0 / community-toolkit
View on GitHub
This repo contains information, links & resources related to organizing community events. Contributions wanted!
☆19Sep 21, 2014Updated 11 years ago
luojie1024 / MossQA-mnbvc
View on GitHub
本项目主要对开源的MOSS SFT数据进行整理，转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面，共353w样本，MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数，共630w样本，
☆13Dec 3, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BA-Transform / BAT-Video-Classification
View on GitHub
This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…
☆12Jan 30, 2021Updated 5 years ago
noajshu / scotus-speech
View on GitHub
Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court
☆22Dec 8, 2022Updated 3 years ago
cmdIrelia / music2dDemo
View on GitHub
MUSIC DOA estimation
☆14Feb 14, 2019Updated 7 years ago
boris-kuz / jaxloudnorm
View on GitHub
Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆13Jan 29, 2025Updated last year
BornInWater / Overlap-Detection
View on GitHub
Overlapped Speech detection in Multi-party Conversations
☆22Feb 20, 2018Updated 8 years ago
audio-captioning / clotho-dataset
View on GitHub
Python code for handling the Clotho dataset.
☆85Nov 24, 2020Updated 5 years ago
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago