IDEA-XL / PRESTOLinks
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]
☆28Updated last year
Alternatives and similar repositories for PRESTO
Users that are interested in PRESTO are comparing it to the libraries listed below
Sorting:
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆52Updated last year
- ☆52Updated last year
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆167Updated last year
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Updated 5 months ago
- ☆51Updated last year
- Pre-trained Language Model for Scientific Text☆45Updated last year
- Must-read papers on NLP for science.☆56Updated 2 years ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆123Updated last year
- Structured Chemistry Reasoning with Large Language Models☆39Updated last year
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Updated 2 years ago
- ☆17Updated last year
- This repository contains information on the creation, evaluation, and benchmark models for the L+M-24 Dataset. L+M-24 will be featured as…☆31Updated 11 months ago
- [EMNLP 2023] ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction.☆22Updated last year
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆50Updated last month
- [NeurIPS 2023] "Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules"☆40Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆53Updated 2 years ago
- Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model☆29Updated 6 months ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆50Updated last year
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆50Updated last year
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Updated 2 years ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆291Updated last year
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆70Updated last month
- The code for GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning☆65Updated last year
- [ICML 2023] FusionRetro: Molecule Representation Fusion via In-Context Learning for Retrosynthetic Planning☆20Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆21Updated 2 years ago
- Code for "Unifying Molecular and Textual Representations via Multi-task Language Modelling" @ ICML 2023☆45Updated last year
- [COLING 2025]A curated paper list about LLMs for chemistry☆128Updated last month
- Source code for the paper 'Uncovering Neural Scaling Laws in Molecular Representation Learning' (NeurIPS 2023 Datasets and Benchmarks).☆14Updated 2 years ago
- Official implementation for Learning Invariant Molecular Representation in Latent Discrete Space (NeurIPS 2023)☆22Updated 2 years ago