HICAI-ZJU / InstructProtein
Code and data for the ACL2024 paper "InstructProtein: Aligning Human and Protein Language via Knowledge Instruction".
☆11Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for InstructProtein
- Exploring Evolution-aware & free protein language models as protein function predictors☆60Updated last month
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆37Updated 7 months ago
- Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]☆31Updated last year
- Code for 'On Pre-trained Language Models For Antibody'☆30Updated last year
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆49Updated 11 months ago
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆87Updated last year
- This repo contains the codes for our paper Conditional Antibody Design as 3D Equivariant Graph Translation.☆87Updated last year
- Open-Protein is an open source pre-training platform that supports multiple protein pre-training models and downstream tasks.☆17Updated last year
- ☆37Updated last year
- ☆43Updated 3 months ago
- ☆39Updated 6 months ago
- Code for "Learning Harmonic Molecular Representations on Riemannian Manifold", ICLR, 2023☆11Updated last year
- MSAGPT☆25Updated 5 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆95Updated last month
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆82Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆45Updated last year
- [RECOMB 2023] Official implementation of "Pisces: A combo-wise contrastive learning approach to synergistic drug combination prediction".☆14Updated 11 months ago
- SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity Prediction (Briefings in Bioinformatics 2023)☆48Updated 5 months ago
- Source code of PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications.☆29Updated 3 months ago
- Must-read papers on NLP for science.☆55Updated last year
- A Protein Large Language Model for Multi-Task Protein Language Processing☆138Updated last month
- An official implementation of Protein Representation Learning via Knowledge Enhanced Primary Structure Reasoning☆25Updated last year
- The PyTorch implementation of MoMu, described in "Natural Language-informed Modeling of Molecule Graphs".☆21Updated last year
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆144Updated last year
- Source code for "A Deep-learning System Bridging Molecule Structure and Biomedical Text with Comprehension Comparable to Human Profession…☆83Updated last year
- The first large protein language model trained follows structure instructions.☆71Updated 5 months ago
- This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"☆93Updated 6 months ago
- Graph Denoising Diffusion for Inverse Protein Folding(NeurIPS 2023)☆56Updated 3 months ago
- Official implementation for Learning Invariant Molecular Representation in Latent Discrete Space (NeurIPS 2023)☆19Updated last year
- This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instru…☆3Updated this week