baaihealth / opi
This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instruction dataset with which LLMs can be adapted to protein-related tasks via instruction tuning and evaluated on these tasks.
☆3Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for opi
- Exploring Evolution-aware & free protein language models as protein function predictors☆60Updated last month
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆96Updated 2 months ago
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆37Updated 8 months ago
- Code for 'On Pre-trained Language Models For Antibody'☆30Updated last year
- A comprehensive repository dedicated to the collection and exploration of studies utilizing Large Language Models for molecular design, p…☆40Updated last year
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆49Updated last year
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆88Updated last year
- ☆49Updated 5 months ago
- A Text-guided Protein Design Framework, Nat Mach Intell 2024☆43Updated 4 months ago
- ☆31Updated 8 months ago
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆82Updated last year
- This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"☆93Updated 7 months ago
- Generative Language Modeling for Antibody Design☆130Updated last month
- The first large protein language model trained follows structure instructions.☆71Updated 5 months ago
- Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]☆32Updated last year
- SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity Prediction (Briefings in Bioinformatics 2023)☆48Updated 5 months ago
- ESM-GearNet for Protein Structure Representation Learning (https://arxiv.org/abs/2303.06275)☆79Updated last year
- ☆66Updated last year
- Code for "Unifying Molecular and Textual Representations via Multi-task Language Modelling" @ ICML 2023☆35Updated 2 months ago
- [ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback☆131Updated 6 months ago
- ☆126Updated 2 years ago
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆144Updated last year
- ☆37Updated last year
- Code implementation of "Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem" https://arxiv.org/…