Leezekun/MMSci

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Leezekun/MMSci)

Leezekun / MMSci

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

☆51

Alternatives and similar repositories for MMSci

Users that are interested in MMSci are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cuixing100876 / InstaStyle
View on GitHub
☆15Jul 24, 2024Updated 2 years ago
KyGao / Mol-StrucTok
View on GitHub
Serializing molecule 3D structures
☆14Nov 27, 2024Updated last year
SciMT / SciMT-benchmark
View on GitHub
☆11Jan 3, 2024Updated 2 years ago
drorlab / GLOW_IVES
View on GitHub
☆15Dec 4, 2023Updated 2 years ago
pranonrahman / ChartSumm
View on GitHub
ChartSum is a large scale benchmark for automatic chart to text summarization
☆11Jul 20, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
GAIR-NLP / Preference-Dissection
View on GitHub
☆25May 16, 2024Updated 2 years ago
wangwangwang23333 / OS-PageSwapManagement
View on GitHub
操作系统内存管理项目
☆14Jun 5, 2021Updated 5 years ago
declare-lab / LLM-PuzzleTest
View on GitHub
This repository is maintained to release dataset and models for multimodal puzzle reasoning.
☆117Feb 26, 2025Updated last year
deepmodeling / SciAssess
View on GitHub
SciAssess is a comprehensive benchmark for evaluating Large Language Models' proficiency in scientific literature analysis across various…
☆89May 21, 2025Updated last year
markendo / downscaling_intelligence
View on GitHub
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models
☆25Mar 21, 2026Updated 4 months ago
apple / ml-mia-bench
View on GitHub
This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
☆38Mar 9, 2025Updated last year
IBM / KVP10k
View on GitHub
Repository for the KVP10k dataset
☆23Sep 18, 2025Updated 10 months ago
LaVi-Lab / Visual-Table
View on GitHub
[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
☆20Oct 17, 2024Updated last year
yeung-lab / Micro-Bench
View on GitHub
A Vision-Language Benchmark for Microscopy Understanding
☆31Mar 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
Victorwz / LLaVA-Unified
View on GitHub
☆23Aug 27, 2025Updated 11 months ago
google / spiqa
View on GitHub
Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]
☆76Jan 13, 2025Updated last year
ruili33 / TPO
View on GitHub
☆41Sep 9, 2025Updated 10 months ago
zyh-uaiaaaa / Generalizable-FER
View on GitHub
Official implementation of the ECCV2024 paper: Generalizable Facial Expression Recognition
☆23Sep 20, 2024Updated last year
OSU-slatelab / MapQA
View on GitHub
☆15Jan 9, 2026Updated 6 months ago
liuxuannan / MMFakeBench
View on GitHub
[ICLR 2025] MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
☆55Mar 25, 2025Updated last year
vis-nlp / UniChart
View on GitHub
☆88Aug 18, 2024Updated last year
dptech-corp / UniParser-Tools
View on GitHub
UniParser-Tools: SDKs, Utilities, and Post-Processing for Industrial-Grade Multi-Modal PDF Parsing
☆22Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
multimodal-art-projection / COIG-P
View on GitHub
☆42Jul 15, 2025Updated last year
QizhiPei / BioT5
View on GitHub
BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)
☆127Sep 14, 2024Updated last year
duyichao / NPDA-KNN-ST
View on GitHub
Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"
☆11Oct 26, 2022Updated 3 years ago
HICAI-ZJU / SciKnowEval
View on GitHub
SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models
☆30Jul 13, 2025Updated last year
phenixace / TOMG-Bench
View on GitHub
☆23Feb 3, 2026Updated 5 months ago
adlnlp / mmvqa
View on GitHub
☆19Sep 11, 2024Updated last year
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
hnlab / BindingNet
View on GitHub
☆27Jul 3, 2024Updated 2 years ago
AI4Chem / ChemistryAgent
View on GitHub
☆27Jun 11, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MiuLab / FactAlign
View on GitHub
Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"
☆19Oct 3, 2024Updated last year
zhongyy / VIoTGPT
View on GitHub
Code of AAAI2025 Paper 《VIoTGPT: Learning to Schedule Vision Tools in LLMs towards Intelligent Video Internet of Things》
☆16Jan 16, 2025Updated last year
czbiohub-sf / Organelle_IP_analyses_and_figures
View on GitHub
Jupyter notebooks for analysis and figures related to the native organelle IP paper
☆14Mar 10, 2026Updated 4 months ago
guanjq / LinkerNet
View on GitHub
The official implementation of LinkerNet: Fragment Poses and Linker Co-Design with 3D Equivariant Diffusion (NeurIPS 2023 Spotlight)
☆19Feb 23, 2024Updated 2 years ago
eric-haibin-lin / verl-community
View on GitHub
☆41Dec 7, 2025Updated 7 months ago
PKU-Alignment / safe-sora
View on GitHub
SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…
☆35Aug 20, 2024Updated last year
Victorwz / tod_as_nlg
View on GitHub
Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".
☆14Apr 6, 2022Updated 4 years ago