This task is based on MUSIC-AVQA Dataset. And we focus on optimize the accuracy of AVQA task, which aims to answer questions regarding different visual objects, sounds, and their associations in videos. The problem requires comprehensive multimodal understanding and spatio-temporal reasoning over audio-visual scenes.
☆13Feb 11, 2023Updated 3 years ago
Alternatives and similar repositories for Audio-Visual-Question-Answering-AVQA
Users that are interested in Audio-Visual-Question-Answering-AVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analyzing and Enhancing Visual Learning in LLM-based Radiology Report Generation☆17Feb 23, 2026Updated last month
- Enhancing Radiology Report Generation via Multi-Phased Supervision☆24Mar 6, 2025Updated last year
- Optimizing Efficiency and Visual-Textual Alignment for LLM-Based Radiology Report Generation☆19Mar 5, 2025Updated last year
- ☆10Apr 12, 2023Updated 2 years ago
- My blog based on the Jekyll theme Chirpy☆18May 21, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆21Mar 2, 2022Updated 4 years ago
- Performance comparison of three Bron–Kerbosch algorithm implementations that find all maximal cliques in a graph.☆25May 12, 2014Updated 11 years ago
- VOCAL-UDF: Self-Enhancing Video Data Management System for Compositional Events with Large Language Models☆12Dec 12, 2025Updated 3 months ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆34Dec 12, 2019Updated 6 years ago
- Generative AI Customer Service Chatbot with MongoDB Atlas and Google Cloud Vertex AI PaLM API☆16Dec 11, 2023Updated 2 years ago
- A small collection of tools to manage deep learning with multiple sources of loss☆17May 6, 2025Updated 11 months ago
- Intrusion Detection System, IDS,Cyberattack Detection,Pytorch,Transformer☆11Oct 17, 2022Updated 3 years ago
- View low level information about NFC tags and their contents, and write your own tags with a dynamic NDEF message editor UI. Qt version f…☆22Jul 22, 2013Updated 12 years ago
- ☆26Oct 13, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Network-Based Malware Detection using Natural Language Processing☆14May 10, 2021Updated 4 years ago
- Implementation of our VLDB'22 paper "Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction"☆54Nov 11, 2022Updated 3 years ago
- 视频AI科普教程——视频运动检测☆17Oct 13, 2020Updated 5 years ago
- GNN4ID: A Toolset for Crafting Graph Neural Network-Based NIDS Datasets☆29Feb 23, 2026Updated last month
- ☆17Oct 30, 2018Updated 7 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆43Dec 6, 2022Updated 3 years ago
- Word2Vec embeddings over packet capture data n-grams.☆20Mar 24, 2023Updated 3 years ago
- Sharkticon is an anomaly detection system, it analyzes your network using a Transformers model adapted to the anomaly detection.☆23May 19, 2023Updated 2 years ago
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆50Nov 10, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Jan 29, 2024Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆37Nov 12, 2022Updated 3 years ago
- ☆23Oct 22, 2024Updated last year
- A relatively simple implementation of the R* Tree data structure for C++☆51Jan 10, 2023Updated 3 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆54Apr 7, 2021Updated 5 years ago
- Face++ starlib 明星库头像标注集爬虫及图片集合,用于face recognition training☆25Sep 29, 2018Updated 7 years ago
- LiPar: A Lightweight Parallel Learning Model for Practical In-Vehicle Network Intrusion Detection (arXiv:2311.08000v2)☆25Nov 22, 2025Updated 4 months ago
- ZVulDrill靶场二次开发,增加了一些常见PHP漏洞,一直在更新。☆32Jun 9, 2017Updated 8 years ago
- Qt library to encode/decode NDEF (NFC Data Exchange Format) messages☆32Sep 28, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆25Jun 29, 2023Updated 2 years ago
- ☆62Jun 17, 2021Updated 4 years ago
- Network data classifier based on the recurrent neural network.☆20Apr 3, 2019Updated 7 years ago
- Implementation of Robust Transformer Based Intrusion Detection, based on the Paper by Wu et. Al☆27Sep 10, 2024Updated last year
- This program allow you to extract some features from pcap files.☆39Apr 4, 2023Updated 3 years ago
- Mirai☆42Oct 19, 2021Updated 4 years ago
- Code for the paper "Anomaly-Based Intrusion Detection in IIoT Networks Using Transformer Models"☆36Mar 3, 2023Updated 3 years ago