baaihealth / opi
This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instruction dataset with which LLMs can be adapted to protein-related tasks via instruction tuning and evaluated on these tasks.
☆15Updated last week
Related projects: ⓘ
- A comprehensive repository dedicated to the collection and exploration of studies utilizing Large Language Models for molecular design, p…☆40Updated last year
- Exploring Evolution-aware & free protein language models as protein function predictors☆57Updated last year
- ☆40Updated 2 months ago
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆34Updated 6 months ago
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆48Updated 10 months ago
- Code for 'On Pre-trained Language Models For Antibody'☆30Updated last year
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆83Updated 11 months ago
- ☆46Updated 3 months ago
- ☆38Updated last year
- The first large protein language model trained follows structure instructions.☆65Updated 3 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆89Updated last week
- ☆26Updated 6 months ago
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆79Updated last year
- SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity Prediction (Briefings in Bioinformatics 2023)☆47Updated 3 months ago
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆141Updated last year
- ESM-GearNet for Protein Structure Representation Learning (https://arxiv.org/abs/2303.06275)☆70Updated 10 months ago
- This repo contains the codes for our paper Conditional Antibody Design as 3D Equivariant Graph Translation.☆84Updated last year
- Code implementation of "Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem" https://arxiv.org/…☆66Updated last year
- Code for the paper Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language☆85Updated last week
- This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"☆89Updated 5 months ago
- Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]☆29Updated last year
- [NeurIPS 2023] DrugCLIP: Contrastive Protein-Molecule Representation Learning for Virtual Screening☆59Updated 4 months ago
- Code for ProSST: A Pre-trained Protein Sequence and Structure Transformer with Disentangled Attention.☆32Updated 3 months ago
- [ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback☆124Updated 4 months ago
- ☆76Updated this week
- ☆53Updated 3 months ago
- Generative Language Modeling for Antibody Design☆116Updated 11 months ago
- Bib'23: Improved the Heterodimer Protein Complex Prediction with Protein Language Models☆14Updated 9 months ago
- Implementation for ICML 2024 paper "MolCRAFT: Structure-Based Drug Design in Continuous Parameter Space"☆59Updated last month
- ☆21Updated this week