[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models
☆294Oct 28, 2024Updated last year
Alternatives and similar repositories for Mol-Instructions
Users that are interested in Mol-Instructions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback☆195Dec 17, 2024Updated last year
- Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through M…☆260Mar 5, 2026Updated 3 months ago
- Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42…☆258Jun 27, 2025Updated last year
- Associated Repository for "Translation between Molecules and Natural Language"☆194Sep 15, 2023Updated 2 years ago
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆173Jul 26, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆127Sep 14, 2024Updated last year
- Code for the paper Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language☆115Feb 26, 2026Updated 4 months ago
- LLM for Drug Editing, ICLR 2024☆160May 28, 2024Updated 2 years ago
- Code and data for the Nature Machine Intelligence paper "Knowledge graph-enhanced molecular contrastive learning with functional prompt".☆138Mar 18, 2024Updated 2 years ago
- ☆23Oct 11, 2022Updated 3 years ago
- ☆1,083Updated this week
- Official Repository for the Uni-Mol Series Methods☆1,121May 29, 2025Updated last year
- ☆259May 17, 2024Updated 2 years ago
- The code for GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning☆66Feb 22, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Must-read papers on NLP for science.☆57Jun 19, 2023Updated 3 years ago
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Jul 17, 2023Updated 2 years ago
- Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality …☆113Jun 9, 2025Updated last year
- ☆53May 24, 2024Updated 2 years ago
- [IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models☆540Jun 17, 2023Updated 3 years ago
- ☆53Apr 19, 2024Updated 2 years ago
- A Protein Large Language Model for Multi-Task Protein Language Processing☆207Sep 30, 2025Updated 9 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆85Feb 25, 2024Updated 2 years ago
- Serializing molecule 3D structures☆14Nov 27, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Downloads USPTO patents and finds molecules related to keyword queries☆73Dec 8, 2023Updated 2 years ago
- The official implementation of the NeurIPS'23 paper ProteinInvBench: Benchmarking Protein Design on Diverse Tasks, Models, and Metrics☆202Sep 18, 2024Updated last year
- Scientific Large Language Models: A Survey on Biological & Chemical Domains☆358Sep 7, 2025Updated 9 months ago
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆54Dec 2, 2024Updated last year
- [NeurIPS 24] Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation☆20Jan 2, 2026Updated 5 months ago
- The PyTorch implementation of MoMu, described in "Natural Language-informed Modeling of Molecule Graphs".☆29Jul 17, 2023Updated 2 years ago
- Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models …☆274Apr 27, 2025Updated last year
- overview of datasets for ML in chemistry☆412Oct 22, 2025Updated 8 months ago
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆104Oct 16, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "Unifying Molecular and Textual Representations via Multi-task Language Modelling" @ ICML 2023☆49Sep 9, 2024Updated last year
- Code and Data for the paper: Molecular Contrastive Learning with Chemical Element Knowledge Graph [AAAI 2022]☆91Feb 3, 2024Updated 2 years ago
- Chemcrow☆931Dec 19, 2024Updated last year
- [ICLR 2024] MARCEL: Machine Learning over Molecular Conformer Ensembles☆46Jun 14, 2023Updated 3 years ago
- Saprot: Protein Language Model with Structural Alphabet (AA+3Di)☆607Mar 8, 2026Updated 3 months ago
- Reaction-Conditioned Virtual Screening of Enzymes☆45Jun 13, 2026Updated 2 weeks ago
- GeoSSL: Molecular Geometry Pretraining with SE(3)-Invariant Denoising Distance Matching, ICLR'23 (https://openreview.net/forum?id=CjTHVo1…☆48Jul 27, 2023Updated 2 years ago