microsoft/WavText5K

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/WavText5K)

microsoft / WavText5K

Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"

☆50

Alternatives and similar repositories for WavText5K

Users that are interested in WavText5K are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FishMaster93 / AFFIA3K
View on GitHub
☆10Apr 12, 2023Updated 3 years ago
akoepke / audio-retrieval-benchmark
View on GitHub
Code for "Audio Retrieval with Natural Language Queries: A Benchmark Study", Transactions on Multimedia 2022
☆54Jul 16, 2025Updated last year
etzinis / heterogeneous_separation
View on GitHub
Code and data recipes for the paper: Heterogeneous Target Speech Separation
☆44Dec 6, 2022Updated 3 years ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
XinhaoMei / audio-text_retrieval
View on GitHub
Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'
☆51May 17, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hqsiswiliam / persona-adaptive-attention
View on GitHub
☆26Oct 13, 2023Updated 2 years ago
audio-captioning / audio-captioning-resources
View on GitHub
A list of resources that can help in research for automated audio captioning
☆34Feb 17, 2021Updated 5 years ago
kyuyeonpooh / objects-that-sound
View on GitHub
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
☆31Jan 29, 2024Updated 2 years ago
Labbeti / aac-datasets
View on GitHub
Audio Captioning datasets for PyTorch.
☆129Mar 25, 2026Updated 3 months ago
swagshaw / ASC-CL
View on GitHub
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Jul 19, 2022Updated 4 years ago
liuxubo717 / sound_generation
View on GitHub
Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021
☆69Sep 3, 2021Updated 4 years ago
cfoster0 / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆88Mar 6, 2022Updated 4 years ago
audio-captioning / audio-captioning-papers
View on GitHub
A list of papers about audio captioning
☆78Jul 1, 2022Updated 4 years ago
liuxubo717 / SimPFs
View on GitHub
Code for "Simple Pooling Front-ends for Efficient Audio Calssification", ICASSP 2023
☆57Mar 3, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
XinhaoMei / WavCaps
View on GitHub
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
☆264Jul 25, 2024Updated last year
oncescuandreea / audio-retrieval
View on GitHub
Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch
☆26Aug 18, 2023Updated 2 years ago
haoheliu / torchsubband
View on GitHub
Pytorch implementation of subband decomposition
☆93Jul 26, 2022Updated 3 years ago
liuxubo717 / LASS
View on GitHub
This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
☆146Oct 11, 2023Updated 2 years ago
liuxubo717 / V-ACT
View on GitHub
Visually-Aware Audio Captioning
☆43Mar 3, 2023Updated 3 years ago
liuxubo717 / LASS-demopage
View on GitHub
☆19Sep 2, 2022Updated 3 years ago
liuxubo717 / cl4ac
View on GitHub
Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.
☆45Oct 8, 2021Updated 4 years ago
yinkalario / General-Purpose-Sound-Recognition-Demo
View on GitHub
General purpose sound recognition demo
☆161Oct 3, 2023Updated 2 years ago
JinhuaLiang / lam4fsl
View on GitHub
An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"
☆31May 31, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
hche11 / VGGSound
View on GitHub
VGGSound: A Large-scale Audio-Visual Dataset
☆359Sep 13, 2021Updated 4 years ago
XinhaoMei / DCASE2021_task6_v2
View on GitHub
Code for CVSSP submission to DCASE 2021 Task 6
☆36Nov 22, 2022Updated 3 years ago
microsoft / CLAP
View on GitHub
Learning audio concepts from natural language supervision
☆672Sep 18, 2024Updated last year
facebookresearch / AudioMAE
View on GitHub
This repo hosts the code and models of "Masked Autoencoders that Listen".
☆671Apr 5, 2024Updated 2 years ago
RicherMans / AudioCaption
View on GitHub
Dataset and baseline for the first Audiocaption task
☆79Jul 25, 2024Updated last year
LAION-AI / audio-dataset
View on GitHub
Audio Dataset for training CLAP and other models
☆747Jan 8, 2026Updated 6 months ago
nttcslab / dcase2023_task2_evaluator
View on GitHub
☆12Aug 10, 2023Updated 2 years ago
wsntxxn / TextToAudioGrounding
View on GitHub
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆49Oct 23, 2025Updated 8 months ago
keetsky / Net_ghostVLAD-pytorch
View on GitHub
☆21Jul 11, 2019Updated 7 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tqbl / arca23k-dataset
View on GitHub
The code used to create the ARCA23K and ARCA23K-FSD datasets
☆16Nov 9, 2021Updated 4 years ago
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
sony / CLIPSep
View on GitHub
☆43Feb 21, 2023Updated 3 years ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
cdjkim / audiocaps
View on GitHub
🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps
☆215Oct 6, 2025Updated 9 months ago
RicherMans / UIT_Mobile
View on GitHub
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆24Mar 6, 2023Updated 3 years ago
microsoft / AudioEntailment
View on GitHub
Audio Entailment: Deductive Reasoning for Audio Understanding
☆17Dec 10, 2024Updated last year