This task is based on MUSIC-AVQA Dataset. And we focus on optimize the accuracy of AVQA task, which aims to answer questions regarding different visual objects, sounds, and their associations in videos. The problem requires comprehensive multimodal understanding and spatio-temporal reasoning over audio-visual scenes.
☆13Feb 11, 2023Updated 3 years ago
Alternatives and similar repositories for Audio-Visual-Question-Answering-AVQA
Users that are interested in Audio-Visual-Question-Answering-AVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analyzing and Enhancing Visual Learning in LLM-based Radiology Report Generation☆17Feb 23, 2026Updated 2 months ago
- Enhancing Radiology Report Generation via Multi-Phased Supervision☆25Mar 6, 2025Updated last year
- Optimizing Efficiency and Visual-Textual Alignment for LLM-Based Radiology Report Generation☆19Mar 5, 2025Updated last year
- ☆10Apr 12, 2023Updated 3 years ago
- My blog based on the Jekyll theme Chirpy☆19May 21, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆21Mar 2, 2022Updated 4 years ago
- Performance comparison of three Bron–Kerbosch algorithm implementations that find all maximal cliques in a graph.☆25May 12, 2014Updated 11 years ago
- VOCAL-UDF: Self-Enhancing Video Data Management System for Compositional Events with Large Language Models☆12Dec 12, 2025Updated 4 months ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆35Dec 12, 2019Updated 6 years ago
- Generative AI Customer Service Chatbot with MongoDB Atlas and Google Cloud Vertex AI PaLM API☆16Dec 11, 2023Updated 2 years ago
- A small collection of tools to manage deep learning with multiple sources of loss☆17May 6, 2025Updated 11 months ago
- Intrusion Detection System, IDS,Cyberattack Detection,Pytorch,Transformer☆11Oct 17, 2022Updated 3 years ago
- View low level information about NFC tags and their contents, and write your own tags with a dynamic NDEF message editor UI. Qt version f…☆22Jul 22, 2013Updated 12 years ago
- ☆26Oct 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Network-Based Malware Detection using Natural Language Processing☆14May 10, 2021Updated 4 years ago
- Implementation of our VLDB'22 paper "Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction"☆55Nov 11, 2022Updated 3 years ago
- 视频AI科普教程——视频运动检测☆17Oct 13, 2020Updated 5 years ago
- GNN4ID: A Toolset for Crafting Graph Neural Network-Based NIDS Datasets☆31Feb 23, 2026Updated 2 months ago
- ☆17Oct 30, 2018Updated 7 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆43Dec 6, 2022Updated 3 years ago
- Word2Vec embeddings over packet capture data n-grams.☆20Mar 24, 2023Updated 3 years ago
- Sharkticon is an anomaly detection system, it analyzes your network using a Transformers model adapted to the anomaly detection.☆23May 19, 2023Updated 2 years ago
- Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"☆50Nov 10, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Jan 29, 2024Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆37Nov 12, 2022Updated 3 years ago
- ☆23Oct 22, 2024Updated last year
- A relatively simple implementation of the R* Tree data structure for C++☆51Jan 10, 2023Updated 3 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆55Apr 7, 2021Updated 5 years ago
- Face++ starlib 明星库头像标注集爬虫及图片集合,用于face recognition training☆25Sep 29, 2018Updated 7 years ago
- LiPar: A Lightweight Parallel Learning Model for Practical In-Vehicle Network Intrusion Detection (arXiv:2311.08000v2)☆25Nov 22, 2025Updated 5 months ago
- ZVulDrill靶场二次开发,增加了一些常见PHP漏洞,一直在更新。☆32Jun 9, 2017Updated 8 years ago
- Qt library to encode/decode NDEF (NFC Data Exchange Format) messages☆32Sep 28, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆25Jun 29, 2023Updated 2 years ago
- ☆62Jun 17, 2021Updated 4 years ago
- Network data classifier based on the recurrent neural network.☆20Apr 3, 2019Updated 7 years ago
- Implementation of Robust Transformer Based Intrusion Detection, based on the Paper by Wu et. Al☆28Sep 10, 2024Updated last year
- This program allow you to extract some features from pcap files.☆39Apr 4, 2023Updated 3 years ago
- Mirai☆42Oct 19, 2021Updated 4 years ago
- Code for the paper "Anomaly-Based Intrusion Detection in IIoT Networks Using Transformer Models"☆37Mar 3, 2023Updated 3 years ago