Stevetich / EventHallusion
EventHallusion: Diagnosing Event Hallucinations in Video LLMs
☆24Updated last month
Related projects ⓘ
Alternatives and complementary repositories for EventHallusion
- [NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities☆22Updated last month
- ☆19Updated 3 weeks ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆15Updated 4 months ago
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability☆35Updated 2 weeks ago
- [NeurIPS 2024] Visual Perception by Large Language Model’s Weights☆29Updated last month
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆16Updated last month
- FreeVA: Offline MLLM as Training-Free Video Assistant☆49Updated 5 months ago
- ☆76Updated 3 weeks ago
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆42Updated 4 months ago
- The official implementation of RAR☆75Updated 7 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆66Updated last year
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆33Updated 2 weeks ago
- Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"☆16Updated 7 months ago
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆34Updated 6 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆30Updated this week
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆21Updated 4 months ago
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆29Updated last month
- Official repository for CoMM Dataset☆24Updated 2 months ago
- ☆11Updated 4 months ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆19Updated last month
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆64Updated last month
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.☆66Updated 3 months ago
- ☆54Updated 4 months ago
- Turning to Video for Transcript Sorting☆46Updated last year
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆41Updated 4 months ago
- VisualGPTScore for visio-linguistic reasoning☆26Updated last year
- ☆36Updated 7 months ago
- Instruction Tuning in Continual Learning paradigm☆26Updated 4 months ago