FudanDISC/ReForm-Eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FudanDISC/ReForm-Eval)

FudanDISC / ReForm-Eval

An benchmark for evaluating the capabilities of large vision-language models (LVLMs)

☆46

Alternatives and similar repositories for ReForm-Eval

Users that are interested in ReForm-Eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FudanDISC / weakly-supervised-mVLP
View on GitHub
Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…
☆19Jul 5, 2023Updated 3 years ago
FudanDISC / DISCOpen-MVPTR
View on GitHub
pytorch implementation of mvp: a multi-stage vision-language pre-training framework
☆11Apr 23, 2022Updated 4 years ago
FudanDISC / DISCOpen-MedBox-DialoDiagnosis
View on GitHub
An Open-Source Tool for Automatic Disease Diagnosis..
☆26May 15, 2022Updated 4 years ago
FudanDISC / SocialAgent
View on GitHub
A collection of resources that investigate social agents.
☆241Apr 22, 2025Updated last year
FudanDISC / DISC-LawLLM
View on GitHub
[中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language models (LLMs) to provide a wide range of legal services.
☆943May 27, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
FudanDISC / DISC-FinLLM
View on GitHub
DISC-FinLLM，中文金融大语言模型（LLM），旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide us…
☆890Nov 1, 2023Updated 2 years ago
archiki / RepARe
View on GitHub
☆21Oct 10, 2023Updated 2 years ago
mshukor / ima-lmms
View on GitHub
[NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
☆23Oct 15, 2024Updated last year
yangbang18 / MultiCapCLIP
View on GitHub
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
☆36Aug 8, 2024Updated last year
FudanDISC / SocioVerse
View on GitHub
☆205Feb 8, 2026Updated 5 months ago
RUCAIBox / ComVint
View on GitHub
The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…
☆19Nov 10, 2023Updated 2 years ago
Lishi905 / SoMeLVLM
View on GitHub
Repository for SoMeLVLM: A Large Vision Language Model for Social Media Processing
☆14Oct 9, 2025Updated 9 months ago
HYPJUDY / Sparkles
View on GitHub
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models
☆45Jun 14, 2024Updated 2 years ago
ljcleo / agent_sense
View on GitHub
Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
☆13Jan 4, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
RUCAIBox / CIR
View on GitHub
☆16Nov 11, 2025Updated 8 months ago
XuqianRen / Semantic-guided-Multi-mask-Image-Harmonization
View on GitHub
The official code of the paper: Semantic-guided Multi-mask Image Harmonization (ECCV2022)
☆15Jul 20, 2022Updated 4 years ago
FaltingsA / SSM
View on GitHub
[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
☆10Aug 10, 2025Updated 11 months ago
tianyi-lab / HallusionBench
View on GitHub
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…
☆342Oct 14, 2025Updated 9 months ago
umd-huang-lab / Mementos
View on GitHub
☆32Feb 8, 2024Updated 2 years ago
RupertLuo / Valley
View on GitHub
The official repository of "Video assistant towards large language model makes everything easy"
☆232Dec 24, 2024Updated last year
OpenKG-ORG / EasyDetect
View on GitHub
An Easy-to-use Hallucination Detection Framework for LLMs.
☆64Apr 21, 2024Updated 2 years ago
Rshcaroline / FDU-Natural-Language-Processing
View on GitHub
This is a repo including all projects in my Introduction to Natural Language Processing course (DATA130006) in School of Data Science @Fu…
☆59Jul 30, 2019Updated 6 years ago
zhongTao99 / ollama
View on GitHub
Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.
☆23Jun 17, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chaudatascience / cipher_multiagent_debate
View on GitHub
Let Models Speak Ciphers: Multiagent Debate through Embeddings
☆17Feb 17, 2024Updated 2 years ago
princetonvisualai / pointingqa
View on GitHub
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆19Oct 4, 2022Updated 3 years ago
qiwang067 / CoWorld
View on GitHub
[NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…
☆28Oct 24, 2025Updated 8 months ago
ChenXiaoFei-CS / KoBo
View on GitHub
Official implementation of MICCAI2023【Knowledge Boosting: Rethinking Medical Contrastive Vision-Langauge Pre-training】
☆16Mar 19, 2024Updated 2 years ago
zchuz / TimeBench
View on GitHub
The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"
☆36Jun 29, 2024Updated 2 years ago
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
BAAI-DCAI / MMVU
View on GitHub
☆57Mar 19, 2025Updated last year
FudanDISC / DISC-MedLLM
View on GitHub
Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful me…
☆564Oct 28, 2023Updated 2 years ago
dahyun-kang / cub-200-2011-part-visualizer
View on GitHub
Visualization tool for CUB-200-2011 part keypoints (Wah et al.).
☆10Sep 17, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZhangXu0963 / VSL
View on GitHub
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆15Dec 25, 2023Updated 2 years ago
gbc-iitd / US_UCL
View on GitHub
[MICCAI'22] Unsupervised Contrastive Learning on Gall Bladder Ultrasound Videos
☆11May 28, 2023Updated 3 years ago
Q-Future / Q-Bench
View on GitHub
①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and vi…
☆287Aug 12, 2024Updated last year
zzzhr97 / SpecBench
View on GitHub
Code repository for the ICML 2026 paper "Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation".
☆24Jun 14, 2026Updated last month
kevinyaobytedance / llm_eval
View on GitHub
LLM evaluation.
☆16Nov 7, 2023Updated 2 years ago
Junction4Nako / mvp_pytorch
View on GitHub
pytorch implementation of mvp: a multi-stage vision-language pre-training framework
☆35Mar 1, 2023Updated 3 years ago
pkunlp-icler / PCA-EVAL
View on GitHub
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
☆107Mar 14, 2024Updated 2 years ago