JoseponLee / IntentQAView external linksLinks
Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.
☆23Nov 29, 2024Updated last year
Alternatives and similar repositories for IntentQA
Users that are interested in IntentQA are comparing it to the libraries listed below
Sorting:
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- A music composer and player with MATLAB☆11Mar 14, 2020Updated 5 years ago
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆37Jan 8, 2025Updated last year
- Official implement of our work: Online Estimating Weight of White Pekin Duck Carcass by Computer Vision☆35Dec 15, 2022Updated 3 years ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆83Jul 1, 2024Updated last year
- [CVPR 2025 Highlight] Interpreting Object-level Foundation Models via Visual Precision Search☆54Nov 24, 2025Updated 2 months ago
- A LLM-powered agent for NetHack☆17Nov 4, 2024Updated last year
- ☆11Jan 27, 2020Updated 6 years ago
- R functions and datasets related to the mapping of text to the United Nations 17 Sustainable Development Goals (SDGs).☆12May 12, 2022Updated 3 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- [ICLR2025] Are Large Vision Language Models Good Game Players?☆13Mar 3, 2025Updated 11 months ago
- ☆18Dec 3, 2021Updated 4 years ago
- Cell2location paper - Comprehensive mapping of tissue cell architecture via integrated single cell and spatial transcriptomics☆15Nov 26, 2022Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Minimal codes for "Task-Oriented Dexterous Hand Pose Synthesis Using Differentiable Grasp Wrench Boundary Estimator [IROS 2024]"☆15Feb 12, 2025Updated last year
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 7 months ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU☆11Jun 22, 2023Updated 2 years ago
- ☆16Mar 27, 2024Updated last year
- [EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation☆13Dec 4, 2023Updated 2 years ago
- 2020年秋国科大模式识别(刘成林、向世明、张煦尧)课后作业☆10Feb 3, 2021Updated 5 years ago
- Application and blog explaining my interpretations of In-run Data Shapley☆24Jan 30, 2025Updated last year
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆13Feb 24, 2025Updated 11 months ago
- operation system simulator base on JavaScript☆12Sep 18, 2020Updated 5 years ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated 11 months ago
- Annotations for the Mistake Detection benchmark of Assembly101☆10Aug 3, 2023Updated 2 years ago
- The source code of ExFunTube☆10Aug 8, 2025Updated 6 months ago
- Streaming JSON parser designed to process JSON data incrementally. The primary goal is to handle potentially incomplete JSON data streams…☆12Apr 5, 2025Updated 10 months ago
- ☆13Mar 31, 2024Updated last year
- [TNNLS, to appear] FET-LM: Flow Enhanced Variational Auto-Encoder for Topic-Guided Language Modeling PyTorch Implementation☆14Mar 4, 2023Updated 2 years ago
- real-time transcription application☆12Jun 9, 2023Updated 2 years ago
- Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023☆13Aug 3, 2023Updated 2 years ago
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- Example scripts for putEMG electromyographic database processing☆11Dec 28, 2020Updated 5 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- Official Implementation of PixelRNN: In-Pixel Recurrent Neural Networks for End-to-end--optimized Perception with Neural Sensors☆15Oct 27, 2024Updated last year
- Escape room adventure game developed in Unity 3D☆12Apr 28, 2019Updated 6 years ago
- ☆21Jul 5, 2025Updated 7 months ago