V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction
☆27Feb 4, 2026Updated last month
Alternatives and similar repositories for V2P-Bench
Users that are interested in V2P-Bench are comparing it to the libraries listed below
Sorting:
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆35Jul 15, 2025Updated 7 months ago
- [AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection☆10Jan 24, 2025Updated last year
- ☆50Jan 13, 2026Updated last month
- ☆47Apr 9, 2025Updated 11 months ago
- Multi-agent AI research system — finds academic papers via semantic search & citation snowballing, then answers questions over them using…☆79Feb 28, 2026Updated last week
- ☆10Jun 30, 2025Updated 8 months ago
- The reinforcement learning codes for dataset SPA-VL☆45Jun 24, 2024Updated last year
- [ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation☆73Apr 25, 2024Updated last year
- DataCompare is a Java-based tool designed to verify the consistency of data after replication or migration operations are completed betwe…☆169Mar 2, 2026Updated last week
- [ICLR 2025] PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection☆22Sep 16, 2025Updated 5 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆57Nov 5, 2025Updated 4 months ago
- ☆21Updated this week
- [ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆20Aug 30, 2024Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- [KDD 2025] AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation☆33Nov 18, 2025Updated 3 months ago
- IEEE Transactions on Affective Computing, 2025☆24Jun 6, 2025Updated 9 months ago
- ☆58Feb 27, 2026Updated last week
- [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆203Sep 26, 2024Updated last year
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆25Jan 31, 2025Updated last year
- IEEE Transactions on Affective Computing, 2022☆28Dec 2, 2023Updated 2 years ago
- 本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。☆45Mar 26, 2024Updated last year
- ☆132Mar 22, 2025Updated 11 months ago
- ☆55Feb 14, 2026Updated 3 weeks ago
- Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'☆345Feb 13, 2026Updated 3 weeks ago
- [NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding☆46Sep 21, 2025Updated 5 months ago
- ☆59Dec 10, 2025Updated 2 months ago
- HuggingChat Python API,make the 'stream' params work☆21Dec 26, 2023Updated 2 years ago
- ☆32Jul 29, 2024Updated last year
- 基于LLM与意图识别的高级多轮问答Agent,采用Cornucopia-LLM与意图识别技术,结合参数提取及slot词槽技术,实现高效的多轮问答交互,具有Function Call、代码解释器、RAG 等功能☆38Apr 26, 2024Updated last year
- This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"☆64Dec 29, 2025Updated 2 months ago
- ⚛ My self website built with react.js☆26Feb 22, 2024Updated 2 years ago
- Official Implementation of "CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion"☆122Feb 28, 2026Updated last week
- ☆55Jun 4, 2025Updated 9 months ago
- 泡面的密码工具箱☆75Dec 21, 2025Updated 2 months ago
- MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations☆36Oct 17, 2024Updated last year
- A simple, High-Performance, Scalable ML/DL Models Repository based on OCI Artifacts☆31Updated this week
- momentum.js offers a toolkit for procedural design and task automation within a user-friendly WYSIWYG interface.☆43Updated this week
- 826专业课复习笔记,供考研贵系、清深、网研院的uu们参考☆55Apr 18, 2025Updated 10 months ago
- The Agentic chaoschain is an innovative framework powered by AI Agents that includes governance, consensus, proposals, and dispute resolu…☆42Mar 18, 2025Updated 11 months ago