si0wang/ViCrit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/si0wang/ViCrit)

si0wang / ViCrit

☆24

Alternatives and similar repositories for ViCrit

Users that are interested in ViCrit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
OoDBag / VisTA
View on GitHub
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
☆27May 31, 2025Updated last year
umd-huang-lab / Mementos
View on GitHub
☆32Feb 8, 2024Updated 2 years ago
NVlabs / PerVLBenchmark
View on GitHub
☆11Jul 31, 2022Updated 3 years ago
InternScience / MME-Reasoning
View on GitHub
Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs
☆45Jun 17, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
agents-x-project / TIR-Bench
View on GitHub
[ECCV 2026] Official implementation of "TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning"
☆25Feb 8, 2026Updated 5 months ago
real-absolute-AI / NoisyRollout
View on GitHub
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆112Sep 18, 2025Updated 10 months ago
showlab / VisInContext
View on GitHub
Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
☆28Oct 30, 2024Updated last year
tianyi-lab / HallusionBench
View on GitHub
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…
☆342Oct 14, 2025Updated 9 months ago
patrick-tssn / VSTAR
View on GitHub
[ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information
☆16Oct 27, 2024Updated last year
LisaAnne / Hallucination
View on GitHub
☆97Mar 29, 2019Updated 7 years ago
allenai / aokvqa
View on GitHub
Official repository for the A-OKVQA dataset
☆117May 8, 2024Updated 2 years ago
open-vision-language / infoseek
View on GitHub
☆78Oct 27, 2023Updated 2 years ago
ayiyayi / EgoExoBench
View on GitHub
☆15Nov 13, 2025Updated 8 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
HJYao00 / MMReason
View on GitHub
[ICCV 2025] MMReason, MLLMs, step by step, reasoning benchmark, AGI
☆15Apr 25, 2026Updated 2 months ago
HDETR / H-PETR-Pose
View on GitHub
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
☆14Sep 1, 2022Updated 3 years ago
CSU-JPG / Chart2Code
View on GitHub
[ACL-main-2026]We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Mode…
☆28Jan 27, 2026Updated 5 months ago
kushalkafle / PReFIL
View on GitHub
Code for the WACV 2020 paper "Answering Questions about Data Visualizations using Efficient Bimodal Fusion"
☆14Jun 22, 2021Updated 5 years ago
sugar-fly / VSFormer
View on GitHub
[AAAI 2024] VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
☆16Apr 7, 2024Updated 2 years ago
YujieLu10 / Seeker
View on GitHub
☆11May 24, 2024Updated 2 years ago
InternRobotics / EgoThinker
View on GitHub
Official implementation of EgoThinker at NIPS 2025
☆29Nov 25, 2025Updated 7 months ago
GXYM / VCapsBench
View on GitHub
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation
☆20Jun 2, 2025Updated last year
apple / ml-mia-bench
View on GitHub
This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
☆38Mar 9, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
CSfufu / Revisual-R1
View on GitHub
[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, mul…
☆212Dec 10, 2025Updated 7 months ago
agneay / pygame-projects
View on GitHub
A curated list of all awesome pygames created by Agneay B Nair
☆10Apr 28, 2024Updated 2 years ago
zhaohengyuan1 / SCT
View on GitHub
(IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"
☆13Mar 20, 2025Updated last year
junyangwang0410 / AMBER
View on GitHub
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
☆172Jan 15, 2024Updated 2 years ago
khoiucd / escape-tgt
View on GitHub
Official Implementation for "ESCAPE: Encoding Super-keypoints for Category-Agnostic Pose Estimation", CVPR 2024.
☆10Jun 17, 2024Updated 2 years ago
RAIVNLab / VideoNet
View on GitHub
CVPR '26 Highlight
☆25May 6, 2026Updated 2 months ago
Lackel / AGLA
View on GitHub
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆68Jul 16, 2024Updated 2 years ago
jayusxp / UECA-Prompt
View on GitHub
UECA-Prompt: Universal Prompt for Emotion Cause Analysis（COLING 2022）
☆16Jun 6, 2023Updated 3 years ago
CG-Bench / CG-Bench
View on GitHub
☆20Jan 26, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
xufangzhi / Genius
View on GitHub
[ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework
☆72Jun 1, 2025Updated last year
zihuixue / AlignEgoExo
View on GitHub
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…
☆19Apr 5, 2024Updated 2 years ago
showlab / AUI
View on GitHub
Computer-Use Agents as Judges for Generative UI
☆44Nov 27, 2025Updated 7 months ago
XMUDeepLIT / AVG-LLaVA
View on GitHub
Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"
☆33Oct 12, 2024Updated last year
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
DAMO-NLP-SG / LLM-Multilingual-Knowledge-Boundaries
View on GitHub
[ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations
☆19Oct 18, 2025Updated 9 months ago
Pascalson / LERG
View on GitHub
A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…
☆16Mar 21, 2022Updated 4 years ago