Temporal Reasoning via Audio Question Answering
☆26Dec 21, 2019Updated 6 years ago
Alternatives and similar repositories for daqa
Users that are interested in daqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆16Aug 17, 2023Updated 2 years ago
- ☆13May 21, 2024Updated last year
- A curated list of resources in audio visual question answering and related area. :-)☆17Jun 29, 2025Updated 8 months ago
- Extra notebooks for CCRMA MIR workshop, 2018 edition☆13Jun 28, 2018Updated 7 years ago
- Splits for epic-sounds dataset☆86Aug 2, 2025Updated 7 months ago
- ☆14Jan 5, 2022Updated 4 years ago
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"☆17Mar 2, 2025Updated last year
- MUSIC-AVQA, CVPR2022 (ORAL)☆98Dec 30, 2022Updated 3 years ago
- OntoLearner: A Modular Python Library for Ontology Learning with LLMs https://pypi.org/project/OntoLearner/☆30Mar 11, 2026Updated last week
- NTU EE6483 Group Project☆19Nov 28, 2024Updated last year
- Visualisation of VISOR Segmentations with Annotations and Relations☆22Aug 15, 2022Updated 3 years ago
- A Human-in-the-Loop Workflow for Scientific Schema Mining with Large Language Models☆32Updated this week
- Example code to help people follow along with the tutorials☆22Aug 21, 2024Updated last year
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- go binary for setting up singularity containers with a miniconda☆19Feb 3, 2026Updated last month
- ☆19Nov 25, 2022Updated 3 years ago
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆69Jul 19, 2025Updated 8 months ago
- skeleton-based action recognition☆19Jan 12, 2022Updated 4 years ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- ☆11May 5, 2022Updated 3 years ago
- A deep learning enabled music generator module built in Python and using TensorFlow.☆27Jun 18, 2020Updated 5 years ago
- Resources for the Information Service Engineering 2021 lecture, summer semester 2021 at Karlsruhe Institute of Technology (KIT).☆15Apr 17, 2025Updated 11 months ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 2 months ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 3 months ago
- Pytorch implementation of [Learning to match transient sound events using attentional similarity for few-shot sound recognition]☆33Feb 27, 2019Updated 7 years ago
- ☆14Sep 22, 2016Updated 9 years ago
- Comparison of auditory DNNs and human brain acitivity.☆18Apr 22, 2025Updated 11 months ago
- Training data for the NLPContributionGraph Shared Task 11 at SemEval-2021☆14Jan 11, 2021Updated 5 years ago
- Deep image generation is becoming a tool to enhance artists and designers creativity potential. In this paper, we aim at making the gener…☆12Aug 18, 2020Updated 5 years ago
- Pytorch implementation of the Double-Step Framework to perform wildfires severity estimation from Sentinel-2 satellite images☆14Nov 10, 2022Updated 3 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- Persian stemmer☆15Jun 18, 2018Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- 浙江工业大学暑假实训,树莓派小车4WD,主题为安保小车,具有红外循迹,自动避障,opencv人脸检测,腾讯API接口实现人脸对比,语音识别功能☆20Jul 11, 2023Updated 2 years ago
- Small python script to download files from a shared dropbox folder in parallel. Script becomes necessary if the folder is too huge to dow…☆32May 4, 2023Updated 2 years ago
- Official codebase for "Online Skeleton-based Action Recognition with Continual Spatio-Temporal Graph Convolutional Networks"☆29Apr 17, 2023Updated 2 years ago
- ☆10Nov 18, 2020Updated 5 years ago
- This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".☆11Dec 2, 2024Updated last year