Yijia-Xiao / Protein-LLM-Survey
Large Language Models in Protein: A Comprehensive Survey
☆55Updated last month
Alternatives and similar repositories for Protein-LLM-Survey:
Users that are interested in Protein-LLM-Survey are comparing it to the libraries listed below
- Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through M…☆200Updated 4 months ago
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆60Updated last week
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆95Updated last year
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆108Updated 6 months ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆267Updated 4 months ago
- MSAGPT☆30Updated 4 months ago
- ☆28Updated 6 months ago
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆50Updated last year
- ☆35Updated last year
- The first large protein language model trained follows structure instructions.☆76Updated 9 months ago
- A Text-guided Protein Design Framework, Nat Mach Intell 2025☆56Updated 2 months ago
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆35Updated 7 months ago
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆88Updated 2 years ago
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆146Updated 2 weeks ago
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆44Updated last year
- Scientific Large Language Models: A Survey on Biological & Chemical Domains☆297Updated last month
- [RECOMB 2023] Official implementation of "Pisces: A combo-wise contrastive learning approach to synergistic drug combination prediction".☆14Updated last year
- Must-read papers on NLP for science.☆58Updated last year
- Code for ProSST: A Pre-trained Protein Sequence and Structure Transformer with Disentangled Attention.☆100Updated last month
- Official Implemetation of DPLM (ICML'24) - Diffusion Language Models Are Versatile Protein Learners☆135Updated 3 weeks ago
- [ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback☆153Updated 3 months ago
- ProTrek: Navigating the Protein Universe through Tri-Modal Contrastive Learning☆93Updated last week
- Making Protein Modeling Accessible to All Biologists☆102Updated this week
- A Protein Large Language Model for Multi-Task Protein Language Processing☆170Updated last month
- A Biological Foundation Model Bridging the Gap between Molecular Sequences Through Central Dogma☆23Updated 2 weeks ago
- This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"☆102Updated last month
- Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42…☆222Updated 2 months ago
- ☆73Updated 3 months ago
- Retrieved Sequence Augmentation for Protein Representation Learning☆50Updated last year
- Source code of PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications.☆32Updated 2 months ago