IDEA-XL / PRESTO
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]
☆23Updated 3 months ago
Alternatives and similar repositories for PRESTO:
Users that are interested in PRESTO are comparing it to the libraries listed below
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆43Updated 2 months ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆21Updated 7 months ago
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆15Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆38Updated last year
- ☆42Updated 10 months ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆94Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆18Updated last year
- Must-read papers on NLP for science.☆58Updated last year
- ☆16Updated 8 months ago
- What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks☆137Updated 6 months ago
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆39Updated 2 months ago
- Pre-trained Language Model for Scientific Text☆44Updated last year
- ☆21Updated 2 years ago
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆66Updated 2 months ago
- [ICML 2023] FusionRetro: Molecule Representation Fusion via In-Context Learning for Retrosynthetic Planning☆21Updated 4 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆44Updated 3 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆36Updated 3 weeks ago
- [ICML 2024] Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation☆22Updated 5 months ago
- Source code for the paper 'Uncovering Neural Scaling Laws in Molecular Representation Learning' (NeurIPS 2023 Datasets and Benchmarks).☆14Updated last year
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆42Updated last year
- Official implementation for Learning Invariant Molecular Representation in Latent Discrete Space (NeurIPS 2023)☆22Updated last year
- ☆11Updated last year
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆103Updated 5 months ago
- [NeurIPS 2023] "Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules"☆33Updated 11 months ago
- Official implementation for the paper "Learning Substructure Invariance for Out-of-Distribution Molecular Representations" (NeurIPS 2022)…☆61Updated 2 years ago
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆54Updated 9 months ago
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆25Updated 10 months ago
- Codes for Merging Large Language Models☆29Updated 6 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆49Updated 4 months ago
- Code and Data Repo for [NeurIPS 2024] Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆22Updated 8 months ago