[ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts
☆100Oct 16, 2023Updated 2 years ago
Alternatives and similar repositories for ProtST
Users that are interested in ProtST are comparing it to the libraries listed below
Sorting:
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆50Mar 14, 2024Updated 2 years ago
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆100Mar 18, 2023Updated 3 years ago
- ☆39Jun 9, 2025Updated 9 months ago
- ☆214Sep 24, 2024Updated last year
- Saprot: Protein Language Model with Structural Alphabet (AA+3Di)☆572Mar 8, 2026Updated last week
- GearNet and Geometric Pretraining Methods for Protein Structure Representation Learning, ICLR'2023 (https://arxiv.org/abs/2203.06125)☆317Jun 13, 2025Updated 9 months ago
- A Text-guided Protein Design Framework, Nat Mach Intell 2025 (https://www.nature.com/articles/s42256-025-01011-z)☆102Jan 11, 2025Updated last year
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆151Mar 10, 2025Updated last year
- The official implementation of the ICLR'23 paper PiFold: Toward effective and efficient protein inverse folding.☆183Jun 17, 2023Updated 2 years ago
- ☆12Dec 2, 2024Updated last year
- This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instru…☆10Mar 27, 2025Updated 11 months ago
- ProteinChat: A frontier protein-language generative model designed to decode the molecular language of proteins.☆64Mar 7, 2025Updated last year
- Code for generating model and predictions for CAFA5 competition 2023 (4th place solution)☆18May 24, 2024Updated last year
- A Protein Large Language Model for Multi-Task Protein Language Processing☆209Sep 30, 2025Updated 5 months ago
- ESM-GearNet for Protein Structure Representation Learning (https://arxiv.org/abs/2303.06275)☆111Oct 23, 2023Updated 2 years ago
- Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]☆33Aug 5, 2023Updated 2 years ago
- ICLR'24 | BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs☆77May 10, 2024Updated last year
- The official implementation of the NeurIPS'23 paper ProteinInvBench: Benchmarking Protein Design on Diverse Tasks, Models, and Metrics☆200Sep 18, 2024Updated last year
- Codebase of the paper "Str2Str: A Score-based Framework for Zero-shot Protein Conformation Sampling" (ICLR 2024)☆81Mar 7, 2024Updated 2 years ago
- ☆92Mar 27, 2023Updated 2 years ago
- Diffusion models of protein structure; trigonometry and attention are all you need!☆565Dec 12, 2023Updated 2 years ago
- CLEAN: a contrastive learning model for high-quality functional prediction of proteins☆307Apr 6, 2025Updated 11 months ago
- Code and data for the ACL2024 paper "InstructProtein: Aligning Human and Protein Language via Knowledge Instruction".☆23Aug 28, 2024Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆53Nov 1, 2023Updated 2 years ago
- Official repository for the ProteinGym benchmarks☆402Jan 12, 2026Updated 2 months ago
- InterLabelGO+: Unraveling label correlations in protein function prediction☆15Aug 5, 2025Updated 7 months ago
- Structure-conditioned masked language modeling for protein sequence design☆72Jan 31, 2024Updated 2 years ago
- Awesome Protein Representation Learning☆686Nov 16, 2024Updated last year
- Evolutionary Scale Modeling (esm): Pretrained language models for proteins☆4,003Feb 7, 2024Updated 2 years ago
- Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42…☆251Jun 27, 2025Updated 8 months ago
- Protein structure datasets for machine learning.☆115Apr 22, 2025Updated 10 months ago
- Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models …☆267Apr 27, 2025Updated 10 months ago
- This is an official implementation for "MMSite: A Multi-modal Framework for the Identification of Active Sites in Proteins".☆18Jan 4, 2025Updated last year
- Bilingual Language Model for Protein Sequence and Structure☆303Mar 6, 2026Updated 2 weeks ago
- Intrinsic-Extrinsic Convolution and Pooling for Learning on 3D Protein Structures☆49Jan 24, 2022Updated 4 years ago
- Zero-shot prediction of mutation effects on protein function with multimodal deep representation learning☆71Aug 13, 2025Updated 7 months ago
- A generative model for programmable protein design☆803Apr 11, 2024Updated last year
- ☆19Aug 5, 2024Updated last year
- Finetuning ProGen2 protein language model for generation of protein sequences from selected protein families.☆88Feb 3, 2025Updated last year