Synthetic Video hallucination and Mitigation
☆18Sep 21, 2025Updated 6 months ago
Alternatives and similar repositories for VideoHallu
Users that are interested in VideoHallu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluatio…☆61Jul 18, 2025Updated 8 months ago
- ☆26Nov 8, 2024Updated last year
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆42Updated this week
- ☆16Dec 8, 2024Updated last year
- Cellular automata traffic simulation☆11Jan 18, 2021Updated 5 years ago
- ☆17Oct 26, 2021Updated 4 years ago
- ☆22May 4, 2025Updated 10 months ago
- KDD21 Attentive Heterogeneous Graph Embedding for Job Mobility Prediction☆13Sep 11, 2022Updated 3 years ago
- Heterogeneous Multi-agent Version of Highway-env☆18Jun 28, 2023Updated 2 years ago
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆17Nov 4, 2025Updated 4 months ago
- AutoHallusion Codebase (EMNLP 2024)☆22Dec 6, 2024Updated last year
- ☆32Feb 8, 2024Updated 2 years ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 5 months ago
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆26Oct 16, 2023Updated 2 years ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆30Apr 8, 2025Updated 11 months ago
- Materials for the course: Data Science for Mechanical System☆34Nov 18, 2025Updated 4 months ago
- Collect real-time transit data and process it into a retroactive GTFS 'schedule' which can be used for routing/analysis☆58Sep 23, 2025Updated 6 months ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆54Jul 14, 2025Updated 8 months ago
- Python package to process NGSIM data and traffic sensing with autonomous vehicles☆58Jun 15, 2020Updated 5 years ago
- SepMark: Deep Separable Watermarking for Unified Source Tracing and Deepfake Detection☆65Mar 8, 2024Updated 2 years ago
- Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024☆67Aug 10, 2024Updated last year
- Unity 人物换装系统解决方案☆117Sep 27, 2022Updated 3 years ago
- Real-time NYC subway data parsing for humans☆120Oct 27, 2025Updated 4 months ago
- Simulations of Traffic System Based on the Theory of Cellular Automaton / 基于元胞自动机的交通系统仿真☆112May 18, 2017Updated 8 years ago
- Code of the paper: A Recipe for Watermarking Diffusion Models☆152Nov 13, 2024Updated last year
- A Unified Database of NYC transport (subway, taxi/Uber, and citibike) data.☆178Jul 13, 2017Updated 8 years ago
- An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.☆256Feb 4, 2025Updated last year
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆296Mar 13, 2024Updated 2 years ago
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…☆355Jun 18, 2023Updated 2 years ago
- Animating scheduled transit trips using the Transitland API and Processing☆289Jan 22, 2020Updated 6 years ago
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆335Oct 14, 2025Updated 5 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆367Sep 6, 2024Updated last year
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model☆373Jun 23, 2024Updated last year
- [ICCV 2023] Official implementation of the paper: "DIRE for Diffusion-Generated Image Detection"☆387Sep 26, 2024Updated last year
- A curated list of trustworthy deep learning papers. Daily updating...☆383Mar 13, 2026Updated last week
- Eagle: Frontier Vision-Language Models with Data-Centric Strategies☆934Oct 25, 2025Updated 4 months ago
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,174Jun 17, 2024Updated last year
- Concept Sliders for Precise Control of Diffusion Models☆1,131Jun 20, 2025Updated 9 months ago
- Existing Literature about Machine Unlearning☆954Aug 29, 2025Updated 6 months ago