ocx-lab / opiLinks
This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instruction dataset with which LLMs can be adapted to protein-related tasks via instruction tuning and evaluated on these tasks.
☆10Updated 8 months ago
Alternatives and similar repositories for opi
Users that are interested in opi are comparing it to the libraries listed below
Sorting:
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆122Updated last year
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆50Updated last year
- Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42…☆249Updated 5 months ago
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆99Updated 2 years ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆289Updated last year
- LLM for Drug Editing, ICLR 2024☆155Updated last year
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆149Updated 9 months ago
- Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through M…☆238Updated 2 weeks ago
- ☆39Updated 6 months ago
- Must-read papers on NLP for science.☆56Updated 2 years ago
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆99Updated 2 years ago
- ☆51Updated last year
- ☆51Updated last year
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆53Updated 2 years ago
- [ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback☆182Updated last year
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆70Updated 3 weeks ago
- Code for the paper Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language☆106Updated last year
- ☆42Updated 3 years ago
- A comprehensive repository dedicated to the collection and exploration of studies utilizing Large Language Models for molecular design, p…☆43Updated 2 years ago
- The first large protein language model trained follows structure instructions.☆90Updated 7 months ago
- Code for 'On Pre-trained Language Models For Antibody'☆32Updated 2 years ago
- A Text-guided Protein Design Framework, Nat Mach Intell 2025 (https://www.nature.com/articles/s42256-025-01011-z)☆98Updated 11 months ago
- Llamole: Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning☆37Updated last year
- Exploring Evolution-aware & free protein language models as protein function predictors☆63Updated last year
- Code for "Unifying Molecular and Textual Representations via Multi-task Language Modelling" @ ICML 2023☆45Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆53Updated 2 years ago
- Large Language Models in Protein: A Comprehensive Survey☆162Updated 8 months ago
- [RECOMB 2023] Official implementation of "Pisces: A combo-wise contrastive learning approach to synergistic drug combination prediction".☆14Updated 2 years ago
- [Briefings in Bioinformatics] A Survey of Generative AI for de novo Drug Design☆94Updated last year
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆166Updated last year