This task is based on MUSIC-AVQA Dataset. And we focus on optimize the accuracy of AVQA task, which aims to answer questions regarding different visual objects, sounds, and their associations in videos. The problem requires comprehensive multimodal understanding and spatio-temporal reasoning over audio-visual scenes.
☆13Feb 11, 2023Updated 3 years ago
Alternatives and similar repositories for Audio-Visual-Question-Answering-AVQA
Users that are interested in Audio-Visual-Question-Answering-AVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analyzing and Enhancing Visual Learning in LLM-based Radiology Report Generation☆17Feb 23, 2026Updated 3 months ago
- Enhancing Radiology Report Generation via Multi-Phased Supervision☆25Mar 6, 2025Updated last year
- Optimizing Efficiency and Visual-Textual Alignment for LLM-Based Radiology Report Generation☆19Mar 5, 2025Updated last year
- Official implementation of "LaCo: Layer-wise Compensation for Pruned Large Language Models" (ACL 2026).☆152May 20, 2026Updated 3 weeks ago
- ☆10Apr 12, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- My blog based on the Jekyll theme Chirpy☆20Updated this week
- ☆21Mar 2, 2022Updated 4 years ago
- Performance comparison of three Bron–Kerbosch algorithm implementations that find all maximal cliques in a graph.☆25May 12, 2014Updated 12 years ago
- VOCAL-UDF: Self-Enhancing Video Data Management System for Compositional Events with Large Language Models☆12Dec 12, 2025Updated 6 months ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆34Dec 12, 2019Updated 6 years ago
- Generative AI Customer Service Chatbot with MongoDB Atlas and Google Cloud Vertex AI PaLM API☆16Dec 11, 2023Updated 2 years ago
- A small collection of tools to manage deep learning with multiple sources of loss☆18May 6, 2025Updated last year
- Intrusion Detection System, IDS,Cyberattack Detection,Pytorch,Transformer☆11Oct 17, 2022Updated 3 years ago
- View low level information about NFC tags and their contents, and write your own tags with a dynamic NDEF message editor UI. Qt version f…☆22Jul 22, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆26Oct 13, 2023Updated 2 years ago
- Network-Based Malware Detection using Natural Language Processing☆14May 10, 2021Updated 5 years ago
- Implementation of our VLDB'22 paper "Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction"☆55Nov 11, 2022Updated 3 years ago
- 视频AI科普教程——视频运动检测☆17Oct 13, 2020Updated 5 years ago
- GNN4ID: A Toolset for Crafting Graph Neural Network-Based NIDS Datasets☆32Feb 23, 2026Updated 3 months ago
- ☆17Oct 30, 2018Updated 7 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆44Dec 6, 2022Updated 3 years ago
- Word2Vec embeddings over packet capture data n-grams.☆20Mar 24, 2023Updated 3 years ago
- Sharkticon is an anomaly detection system, it analyzes your network using a Transformers model adapted to the anomaly detection.☆23May 19, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆50Nov 10, 2022Updated 3 years ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Jan 29, 2024Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Nov 12, 2022Updated 3 years ago
- ☆23Oct 22, 2024Updated last year
- A relatively simple implementation of the R* Tree data structure for C++☆51Jan 10, 2023Updated 3 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆54Apr 7, 2021Updated 5 years ago
- Face++ starlib 明星库头像标注集爬虫及图片集合,用于face recognition training☆25Sep 29, 2018Updated 7 years ago
- LiPar: A Lightweight Parallel Learning Model for Practical In-Vehicle Network Intrusion Detection (arXiv:2311.08000v2)☆26Nov 22, 2025Updated 6 months ago
- ZVulDrill靶场二次开发,增加了一些常见PHP漏洞,一直在更新。☆31Jun 9, 2017Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Qt library to encode/decode NDEF (NFC Data Exchange Format) messages☆31Sep 28, 2020Updated 5 years ago
- ☆25Jun 29, 2023Updated 2 years ago
- ☆62Jun 17, 2021Updated 4 years ago
- Network data classifier based on the recurrent neural network.☆20Apr 3, 2019Updated 7 years ago
- Implementation of Robust Transformer Based Intrusion Detection, based on the Paper by Wu et. Al☆29Sep 10, 2024Updated last year
- Mirai☆42Oct 19, 2021Updated 4 years ago
- Code for the paper "Anomaly-Based Intrusion Detection in IIoT Networks Using Transformer Models"☆38Mar 3, 2023Updated 3 years ago