facebookresearch/daqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/daqa)

facebookresearch / daqa

Temporal Reasoning via Audio Question Answering

☆27

Alternatives and similar repositories for daqa

Users that are interested in daqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mira-ai-lab / MUSIC-AVQA-R
View on GitHub
☆13May 21, 2024Updated 2 years ago
epic-kitchens / epic-sounds-annotations
View on GitHub
Splits for epic-sounds dataset
☆85Aug 2, 2025Updated 11 months ago
GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
markusaksli / ai-music
View on GitHub
A vanilla Trasformer Decoder music generation model trained on Final Fantasy OST MIDI songs
☆14Jan 14, 2022Updated 4 years ago
rpidanny / ffmpeg-lambda-layer
View on GitHub
AWS Lambda layer with ffmpeg binary.
☆12Jun 18, 2019Updated 7 years ago
terry-yip / speech-to-text
View on GitHub
Speaker diarization and speech to text
☆14Dec 17, 2020Updated 5 years ago
falloutdurham / specaugment
View on GitHub
PyTorch Implementation of Time/Frequency Masks
☆12May 22, 2019Updated 7 years ago
epic-kitchens / VISOR-VIS
View on GitHub
Visualisation of VISOR Segmentations with Annotations and Relations
☆22Aug 15, 2022Updated 3 years ago
partha2409 / DCASE2024_seld_baseline
View on GitHub
☆52Dec 13, 2025Updated 7 months ago
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
ZhanboShiAI / ENMuS
View on GitHub
[AAAI 2025] Towards Audio-visual Navigation in Noisy Environments: A Large-scale Benchmark Dataset and An Architecture Considering Multip…
☆15May 21, 2026Updated 2 months ago
AlyssaYoung / AVQA
View on GitHub
ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos
☆15Aug 17, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cezannec / ML-microservice-kubernetes
View on GitHub
Deploying a containerized machine learning app, which estimates housing prices, using Docker and Kubernetes (locally).
☆15May 30, 2019Updated 7 years ago
ga642381 / AudioCodec-Hub
View on GitHub
AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models
☆25Sep 26, 2023Updated 2 years ago
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆45Nov 25, 2020Updated 5 years ago
habla-liaa / encodecmae
View on GitHub
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
☆101Jul 24, 2024Updated last year
3dlg-hcvc / multion-challenge
View on GitHub
Starter code and instructions for participating in MultiON Challenge 2021.
☆12Jun 12, 2024Updated 2 years ago
galacticglum / composer
View on GitHub
A deep learning enabled music generator module built in Python and using TensorFlow.
☆27Jun 18, 2020Updated 6 years ago
jonschlinkert / fs-exists-sync
View on GitHub
Drop-in replacement for `fs.existsSync` with zero dependencies. Other libs I found either have crucial differences from fs.existsSync, or…
☆12Sep 1, 2017Updated 8 years ago
elianap / divexplorer
View on GitHub
☆11May 5, 2022Updated 4 years ago
breezedeus / LoveShare
View on GitHub
breezedeus的各种分享
☆22Jan 31, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
vscomputer / chuck-examples
View on GitHub
Example code to help people follow along with the tutorials
☆25Aug 21, 2024Updated last year
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 4 months ago
bestjane / pet-shop
View on GitHub
一个基于以太坊的区块链宠物商店
☆11Feb 19, 2018Updated 8 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
daaku / nodejs-makeerror
View on GitHub
A library to make errors.
☆12Oct 23, 2021Updated 4 years ago
jiasenlu / vit-vqgan-jax
View on GitHub
Jax implementation of VIT-VQGAN
☆10Jan 25, 2024Updated 2 years ago
gallipoligiuseppe / TST-CycleGAN
View on GitHub
This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".
☆11Dec 2, 2024Updated last year
K-STMLab / SSL4PR
View on GitHub
This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…
☆12Dec 19, 2025Updated 7 months ago
facebookresearch / pix2vec
View on GitHub
Deep image generation is becoming a tool to enhance artists and designers creativity potential. In this paper, we aim at making the gener…
☆13Aug 18, 2020Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
kevinco27 / attentional-similarity
View on GitHub
Pytorch implementation of [Learning to match transient sound events using attentional similarity for few-shot sound recognition]
☆33Feb 27, 2019Updated 7 years ago
talbaram3192 / Emotion_Recognition_project
View on GitHub
☆23Feb 27, 2021Updated 5 years ago
ajd12342 / paraspeechcaps
View on GitHub
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
☆162Mar 26, 2026Updated 3 months ago
SonyResearch / dcase2025_stereo_seld_data_generator
View on GitHub
Data generator for stereo sound event localization and detection task of DCASE 2025 challenge
☆17Jul 17, 2025Updated last year
zcrabbit / vbpi-gnn
View on GitHub
Code for learnable topological features for phylogenetic inference via graph neural networks
☆10Mar 3, 2023Updated 3 years ago
danelee2601 / Cosine-similarity-classifier
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
wuyi2020 / DoRM
View on GitHub
[NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"
☆13Aug 2, 2024Updated last year