HKUNLP / RSALinks
Retrieved Sequence Augmentation for Protein Representation Learning
☆53Updated last year
Alternatives and similar repositories for RSA
Users that are interested in RSA are comparing it to the libraries listed below
Sorting:
- Exploring Evolution-aware & free protein language models as protein function predictors☆63Updated 10 months ago
- Code for 'On Pre-trained Language Models For Antibody'☆33Updated 2 years ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆116Updated 10 months ago
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆49Updated last year
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆92Updated 2 years ago
- The first large protein language model trained follows structure instructions.☆84Updated 2 months ago
- A Text-guided Protein Design Framework, Nat Mach Intell 2025 (https://www.nature.com/articles/s42256-025-01011-z)☆90Updated 6 months ago
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆53Updated last year
- Source code of the paper "Protein Sequence and Structure Co-Design with Equivariant Translation"☆24Updated last year
- ☆42Updated 2 years ago
- Official implementation of "Learning the language of protein structures"☆37Updated last month
- This repository contains information on the creation, evaluation, and benchmark models for the L+M-24 Dataset. L+M-24 will be featured as…☆30Updated 6 months ago
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆98Updated last year
- A comprehensive benchmark on the performances of multiple protein backbone generative models.☆62Updated 2 months ago
- NeurIPS 2023 Spotlight paper: Full atom protein pocket design via iterative refinement☆47Updated last year
- Source code of PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications.☆33Updated 2 months ago
- Must-read papers on NLP for science.☆58Updated 2 years ago
- [ICML2025] The official implementation of "WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State…☆17Updated 2 months ago
- An official implementation of Protein Representation Learning via Knowledge Enhanced Primary Structure Reasoning☆29Updated 2 years ago
- Evolutionary Algorithm with Diffusion Models for Protein Design☆25Updated 5 months ago
- ☆17Updated last year
- Protein-Nucleic Acid Complex Modeling with Frame Averaging Transformer, NeurIPS2024☆27Updated 2 months ago
- Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax☆113Updated 3 years ago
- ☆58Updated last year
- Implementation of the DDPM + IPA (invariant point attention) for protein generation, as outlined in the paper "Protein Structure and Sequ…☆89Updated 3 years ago
- MSAGPT☆35Updated 8 months ago
- Protein Design by Machine Learning guided Directed Evolution☆44Updated 4 months ago
- A Modular Architecture for Deep Learning Systems☆43Updated 2 months ago
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆67Updated 4 months ago
- Official repository for "Plug & Play Directed Evolution for Proteins with Gradient-Based Discrete MCMC"☆12Updated 2 years ago