yfliu87 / DataMining_Capstone
Capstone project of UIUC DataMining course
☆8Updated 9 years ago
Alternatives and similar repositories for DataMining_Capstone:
Users that are interested in DataMining_Capstone are comparing it to the libraries listed below
- ☆9Updated 2 months ago
- A repository of useful scripts for the course CS357 in the form of Jupyter Notebook.☆12Updated 3 years ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆40Updated 4 months ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆13Updated last year
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆12Updated last year
- 【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge☆15Updated last year
- ☆29Updated 8 months ago
- ☆18Updated 8 months ago
- ☆19Updated last year
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆14Updated 5 months ago
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆23Updated last year
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆27Updated this week
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆21Updated last year
- Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"☆25Updated 6 months ago
- ☆29Updated 2 months ago
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Updated last year
- Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"☆16Updated 3 months ago
- ☆12Updated 8 months ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆36Updated 3 weeks ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos☆14Updated last month
- ☆25Updated 8 months ago
- ☆45Updated this week
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13Updated last year
- ☆12Updated last year
- Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.☆33Updated last year
- GPT-4V(ision) as A Social Media Analysis Engine☆35Updated 3 months ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆10Updated last year
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆92Updated this week
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆29Updated 4 months ago
- Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, and external …☆13Updated 6 months ago