baaihealth / opiLinks
This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instruction dataset with which LLMs can be adapted to protein-related tasks via instruction tuning and evaluated on these tasks.
☆6Updated 3 months ago
Alternatives and similar repositories for opi
Users that are interested in opi are comparing it to the libraries listed below
Sorting:
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆114Updated 9 months ago
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆147Updated 3 months ago
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆48Updated last year
- A comprehensive repository dedicated to the collection and exploration of studies utilizing Large Language Models for molecular design, p…☆41Updated last year
- A Biological Foundation Model Bridging the Gap between Molecular Sequences Through Central Dogma☆27Updated 3 months ago
- Exploring Evolution-aware & free protein language models as protein function predictors☆63Updated 9 months ago
- The first large protein language model trained follows structure instructions.☆81Updated last month
- ☆50Updated last year
- Must-read papers on NLP for science.☆58Updated 2 years ago
- ☆34Updated 2 weeks ago
- ☆40Updated last year
- Protein-Nucleic Acid Complex Modeling with Frame Averaging Transformer, NeurIPS2024☆26Updated last month
- LLM for Drug Editing, ICLR 2024☆149Updated last year
- [RECOMB 2023] Official implementation of "Pisces: A combo-wise contrastive learning approach to synergistic drug combination prediction".☆14Updated last year
- ☆71Updated 3 weeks ago
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆89Updated 2 years ago
- Code for 'On Pre-trained Language Models For Antibody'☆33Updated 2 years ago
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆51Updated last year
- A Text-guided Protein Design Framework, Nat Mach Intell 2025 (https://www.nature.com/articles/s42256-025-01011-z)☆86Updated 5 months ago
- This repo contains the results data for Round 1 of Adaptyv Bio’s EGFR Protein Design Competition.☆34Updated 6 months ago
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆97Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆52Updated last year
- ☆58Updated last year
- Protein Structure Transformer (PST): Endowing pretrained protein language models with structural knowledge☆42Updated 8 months ago
- Large Language Models in Protein: A Comprehensive Survey☆92Updated 2 months ago
- ☆42Updated 2 years ago
- ESM-GearNet for Protein Structure Representation Learning (https://arxiv.org/abs/2303.06275)☆102Updated last year
- MSAGPT☆34Updated 7 months ago
- Computational predictor of protein intrinsic disorder and its functions☆10Updated last year
- This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"☆110Updated 4 months ago