kq-chen / VLMEvalKitLinks
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks
☆15Updated 11 months ago
Alternatives and similar repositories for VLMEvalKit
Users that are interested in VLMEvalKit are comparing it to the libraries listed below
Sorting:
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆105Updated 8 months ago
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆25Updated 2 years ago
- ☆114Updated last month
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆92Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆120Updated 4 months ago
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109Updated 8 months ago
- ☆179Updated 2 months ago
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆84Updated 3 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆121Updated 8 months ago
- ☆39Updated 6 months ago
- ☆96Updated last year
- [EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"☆66Updated 9 months ago
- ☆51Updated last year
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆208Updated 3 weeks ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated last year
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆62Updated 2 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- ☆182Updated 9 months ago
- ☆87Updated 2 years ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆63Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆85Updated 2 years ago
- ☆111Updated 7 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆95Updated 2 months ago
- [ICLR 2025] A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆90Updated last week
- ☆27Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆39Updated last year
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆67Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆188Updated last year
- ☆75Updated last year