IDEA-XL / PRESTOLinks
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]
☆28Updated 7 months ago
Alternatives and similar repositories for PRESTO
Users that are interested in PRESTO are comparing it to the libraries listed below
Sorting:
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆47Updated 7 months ago
- Collection of latest papers and materials in the area of RLVR!☆16Updated last month
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆27Updated last month
- ☆46Updated last year
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Updated 2 years ago
- What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks☆152Updated 11 months ago
- ☆45Updated 9 months ago
- Must-read papers on NLP for science.☆58Updated 2 years ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Updated last year
- Structured Chemistry Reasoning with Large Language Models☆40Updated last year
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆114Updated 10 months ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆280Updated 8 months ago
- Retrieved Sequence Augmentation for Protein Representation Learning☆53Updated last year
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆46Updated last week
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆45Updated 3 months ago
- Pre-trained Language Model for Scientific Text☆45Updated last year
- [AAAI 2024] SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research☆27Updated 11 months ago
- The code for GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning☆62Updated last year
- This repository contains information on the creation, evaluation, and benchmark models for the L+M-24 Dataset. L+M-24 will be featured as…☆30Updated 5 months ago
- ☆17Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆45Updated 7 months ago
- Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model☆26Updated 3 weeks ago
- NeurIPS'22 Oral: EquiVSet - Learning Neural Set Functions Under the Optimal Subset Oracle☆20Updated 2 years ago
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆48Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆20Updated 2 years ago
- ICLR'24 | BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs☆74Updated last year
- BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science☆20Updated 9 months ago
- Official implementation for Learning Invariant Molecular Representation in Latent Discrete Space (NeurIPS 2023)☆21Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆26Updated last year