Leezekun / MMSci
MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension
☆24Updated last month
Related projects ⓘ
Alternatives and complementary repositories for MMSci
- Pre-trained Language Model for Scientific Text☆42Updated 9 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆36Updated last year
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers"☆40Updated last month
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes☆19Updated this week
- Structured Chemistry Reasoning with Large Language Models☆31Updated 6 months ago
- The Open Source Code for LLM4SD (Large Language Models for Scientific Synthesis, Inference and Explanation)☆30Updated 3 weeks ago
- Holistic evaluation of multimodal foundation models☆41Updated 3 months ago
- ☆28Updated last month
- Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs☆10Updated 9 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆36Updated last week
- Code and data for the benchmark "Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Lan…☆34Updated 4 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆63Updated 9 months ago
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆14Updated 3 weeks ago
- ☆11Updated 10 months ago
- ☆21Updated this week
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆44Updated 3 months ago
- ☆41Updated 7 months ago
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆37Updated 9 months ago
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆23Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆28Updated 2 weeks ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆69Updated last month
- MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆53Updated 2 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆12Updated 2 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆96Updated 2 months ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆33Updated last year
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆17Updated 3 months ago
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆53Updated 3 months ago
- Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models☆64Updated last year
- [ICML 2024] One Prompt is Not Enough: Automated Construction of a Mixture-of-Expert Prompts - TurningPoint AI☆13Updated last month