Temporal Reasoning via Audio Question Answering
☆26Dec 21, 2019Updated 6 years ago
Alternatives and similar repositories for daqa
Users that are interested in daqa are comparing it to the libraries listed below
Sorting:
- The code reproduces the results of the experiments in the paper. In particular, it performs experiments in which machine-learning models …☆20Aug 16, 2021Updated 4 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27May 30, 2025Updated 9 months ago
- Bench4KE, a benchmarking system for knowledge engineering automation tasks.☆14Jan 26, 2026Updated last month
- Apparel Classification for Indian Ethnic Clothes☆12Feb 10, 2023Updated 3 years ago
- Deep image generation is becoming a tool to enhance artists and designers creativity potential. In this paper, we aim at making the gener…☆12Aug 18, 2020Updated 5 years ago
- MUSIC-AVQA, CVPR2022 (ORAL)☆96Dec 30, 2022Updated 3 years ago
- safe and easy programming for you☆10Sep 1, 2022Updated 3 years ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆46May 24, 2025Updated 9 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- sbt plugin to detect Akka module mismatches and fail build☆10Sep 15, 2025Updated 5 months ago
- ☆13Oct 4, 2021Updated 4 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆11Dec 19, 2025Updated 2 months ago
- You don't need ssh private key for EC2 instance FOREVER☆10Jun 28, 2021Updated 4 years ago
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"☆13Aug 2, 2024Updated last year
- Implemented YOLOv3 with Tensorflow 2.0☆14Jan 12, 2023Updated 3 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"☆16Mar 2, 2025Updated last year
- A library to make errors.☆12Oct 23, 2021Updated 4 years ago
- 🥅 Capture errors from `defer`'d cleanup functions. Reliably!☆12Nov 21, 2025Updated 3 months ago
- Lane segmentation model trained with tensorflow implementation MobileNetV2 based U-Net☆11Mar 24, 2023Updated 2 years ago
- This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".☆10Dec 2, 2024Updated last year
- ☆11May 5, 2022Updated 3 years ago
- ☆13May 21, 2024Updated last year
- Deadly simple and effective fixtures preparing for testing for Go☆11Dec 11, 2024Updated last year
- ☆13May 31, 2023Updated 2 years ago
- An open source in memory Graph Database for Social Networks☆10Sep 20, 2022Updated 3 years ago
- Color scheme optimizer for terminal☆11Apr 9, 2022Updated 3 years ago
- OCaml parsers for multiple key formats☆15Aug 1, 2024Updated last year
- License Plate Recognition based on semantic segmentation approach using U-Net☆13Dec 5, 2019Updated 6 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated last month
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆16Aug 17, 2023Updated 2 years ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆54Jul 16, 2025Updated 7 months ago
- ☆13Mar 25, 2021Updated 4 years ago
- The official training/validation/test dataset repository for the SOTA? task as SimpleText Task4@CLEF2024☆15Jul 7, 2024Updated last year
- CLI: Delete GitHub Branches by pattern matching.☆16Aug 23, 2022Updated 3 years ago
- ☆13Dec 26, 2019Updated 6 years ago