Temporal Reasoning via Audio Question Answering
☆27Dec 21, 2019Updated 6 years ago
Alternatives and similar repositories for daqa
Users that are interested in daqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆28Jun 10, 2026Updated 3 weeks ago
- ☆13May 21, 2024Updated 2 years ago
- A curated list of resources in audio visual question answering and related area. :-)☆17Jun 29, 2025Updated last year
- A makeshift python program which relies on nltk and Stanford Core NLP models to expand common contractions in the english language.☆10Nov 8, 2017Updated 8 years ago
- Splits for epic-sounds dataset☆86Aug 2, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jan 5, 2022Updated 4 years ago
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆52May 24, 2025Updated last year
- ☆13Sep 4, 2023Updated 2 years ago
- AWS Lambda layer with ffmpeg binary.☆12Jun 18, 2019Updated 7 years ago
- Speaker diarization and speech to text☆14Dec 17, 2020Updated 5 years ago
- Data generator for stereo sound event localization and detection task of DCASE 2025 challenge☆17Jul 17, 2025Updated 11 months ago
- Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"☆22Mar 2, 2025Updated last year
- ☆75Aug 7, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Starter code and instructions for participating in MultiON Challenge 2021.☆12Jun 12, 2024Updated 2 years ago
- go binary for setting up singularity containers with a miniconda☆21Feb 3, 2026Updated 4 months ago
- ☆19Nov 25, 2022Updated 3 years ago
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆73Mar 22, 2026Updated 3 months ago
- Deploying a containerized machine learning app, which estimates housing prices, using Docker and Kubernetes (locally).☆15May 30, 2019Updated 7 years ago
- Materiales del taller de Meta Aprendizaje de la RIIAA 2020.☆13Aug 26, 2020Updated 5 years ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- Fairlex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing☆16Jul 25, 2023Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11May 5, 2022Updated 4 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 5 months ago
- ☆27May 27, 2025Updated last year
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 6 months ago
- ☆14Sep 22, 2016Updated 9 years ago
- Pytorch implementation of [Learning to match transient sound events using attentional similarity for few-shot sound recognition]☆33Feb 27, 2019Updated 7 years ago
- ☆19Sep 5, 2024Updated last year
- Comparison of auditory DNNs and human brain acitivity.☆19Apr 22, 2025Updated last year
- Deep image generation is becoming a tool to enhance artists and designers creativity potential. In this paper, we aim at making the gener…☆13Aug 18, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pytorch implementation of the Double-Step Framework to perform wildfires severity estimation from Sentinel-2 satellite images☆14Nov 10, 2022Updated 3 years ago
- Code for learnable topological features for phylogenetic inference via graph neural networks☆10Mar 3, 2023Updated 3 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆11May 8, 2018Updated 8 years ago
- Persian stemmer☆15Jun 18, 2018Updated 8 years ago
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"☆13Aug 2, 2024Updated last year
- ☆16Sep 4, 2019Updated 6 years ago
- Convert a username/group name to a uid/gid number☆18Oct 8, 2015Updated 10 years ago