IDEA-XL / PRESTOLinks
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]
☆28Updated 8 months ago
Alternatives and similar repositories for PRESTO
Users that are interested in PRESTO are comparing it to the libraries listed below
Sorting:
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆47Updated 8 months ago
- What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks☆155Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆45Updated 8 months ago
- ☆47Updated last year
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆27Updated 3 weeks ago
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Updated 2 years ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆116Updated 10 months ago
- ☆17Updated last year
- Collection of latest papers and materials in the area of RLVR!☆22Updated last month
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆283Updated 9 months ago
- Must-read papers on NLP for science.☆58Updated 2 years ago
- Code for AAAI24 paper Text-Guided Molecule Generation with Diffusion Language Model☆27Updated last month
- ☆47Updated 9 months ago
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆46Updated 4 months ago
- Pre-trained Language Model for Scientific Text☆45Updated last year
- The code for GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning☆62Updated last year
- This repository contains information on the creation, evaluation, and benchmark models for the L+M-24 Dataset. L+M-24 will be featured as…☆30Updated 6 months ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆46Updated last month
- Structured Chemistry Reasoning with Large Language Models☆40Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆53Updated last year
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Updated last year
- [EMNLP 2023] ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction.☆21Updated last year
- [NeurIPS 2023] "Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules"☆38Updated last year
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆67Updated 4 months ago
- Source code for the paper 'Uncovering Neural Scaling Laws in Molecular Representation Learning' (NeurIPS 2023 Datasets and Benchmarks).☆14Updated last year
- [ICML2025] The official implementation of "WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State…☆17Updated 2 months ago
- [ICML 2023] FusionRetro: Molecule Representation Fusion via In-Context Learning for Retrosynthetic Planning☆21Updated 9 months ago
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆49Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆20Updated 2 years ago
- [CIKM2023] The official implementation of "MPerformer: An SE(3) Transformer-based Molecular Perceptron"☆20Updated 8 months ago