☆37Jan 25, 2024Updated 2 years ago
Alternatives and similar repositories for AutoEval-Video
Users that are interested in AutoEval-Video are comparing it to the libraries listed below
Sorting:
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- ☆17Feb 22, 2024Updated 2 years ago
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆42Dec 16, 2025Updated 2 months ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- ☆28Nov 10, 2025Updated 3 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57May 28, 2025Updated 9 months ago
- [ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark☆137Jul 9, 2025Updated 7 months ago
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- Repository related to Cranfield's AAI MSCs GDP☆11Apr 8, 2023Updated 2 years ago
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Apr 9, 2024Updated last year
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated 9 months ago
- Official code for ICLR 2024 paper "Do Generated Data Always Help Contrastive Learning?"☆31Apr 4, 2024Updated last year
- ☆10Feb 10, 2026Updated 3 weeks ago
- VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation☆86Sep 12, 2024Updated last year
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆251Aug 21, 2025Updated 6 months ago
- LED : Light Enhanced Depth Estimation at Night☆13Dec 9, 2025Updated 2 months ago
- ☆10Nov 15, 2023Updated 2 years ago
- ☆10Sep 5, 2024Updated last year
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 11 months ago
- P1AC: Revisiting Absolute Pose From a Single Affine Correspondence☆11Mar 19, 2024Updated last year
- ☆12Jun 26, 2024Updated last year
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- ☆10Updated this week
- RAMPA: Robotic Augmented Reality for Machine Programming by Demonstration https://arxiv.org/abs/2410.13412☆16Oct 6, 2025Updated 5 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆126Sep 28, 2025Updated 5 months ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆108Aug 21, 2025Updated 6 months ago
- ☆10Apr 17, 2024Updated last year
- An Android Application for GLCC☆11Sep 30, 2022Updated 3 years ago
- OpenCV Sample Projects in Rust☆12Nov 27, 2021Updated 4 years ago
- The Code Café is a low key, relaxed series of meetups where we explore various technologies, tools, programming aids and other fun stuff …☆15Dec 22, 2022Updated 3 years ago
- Codebase for the EMNLP 2021 paper "HittER: Hierarchical Transformers for Knowledge Graph Embeddings".☆12Nov 1, 2021Updated 4 years ago
- ☆10May 12, 2018Updated 7 years ago
- ☆12Oct 24, 2023Updated 2 years ago
- ☆18Dec 8, 2024Updated last year
- ☆10Aug 29, 2023Updated 2 years ago
- Framework to run the Federated Test-Time-Adaptation method StarAlign☆11May 17, 2024Updated last year
- ☆13Jan 13, 2025Updated last year