HKUNLP / RSA
Retrieved Sequence Augmentation for Protein Representation Learning
☆50Updated last year
Alternatives and similar repositories for RSA:
Users that are interested in RSA are comparing it to the libraries listed below
- A Text-guided Protein Design Framework, Nat Mach Intell 2025☆56Updated 2 months ago
- Code for 'On Pre-trained Language Models For Antibody'☆33Updated 2 years ago
- Exploring Evolution-aware & free protein language models as protein function predictors☆63Updated 5 months ago
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆44Updated last year
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆88Updated 2 years ago
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆50Updated last year
- NeurIPS 2023 Spotlight paper: Full atom protein pocket design via iterative refinement☆46Updated last year
- A comprehensive benchmark on the performances of multiple protein backbone generative models.☆53Updated this week
- Must-read papers on NLP for science.☆58Updated last year
- ☆40Updated 2 years ago
- Source code of the paper "Protein Sequence and Structure Co-Design with Equivariant Translation"☆23Updated last year
- MSAGPT☆30Updated 4 months ago
- Source code of PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications.☆32Updated 2 months ago
- The first large protein language model trained follows structure instructions.☆75Updated 9 months ago
- Code for paper: "Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design"☆39Updated 2 weeks ago
- Code and data for the ACL2024 paper "InstructProtein: Aligning Human and Protein Language via Knowledge Instruction".☆17Updated 6 months ago
- This repository contains information on the creation, evaluation, and benchmark models for the L+M-24 Dataset. L+M-24 will be featured as…☆27Updated 2 months ago
- This repo contains the codes for our paper Conditional Antibody Design as 3D Equivariant Graph Translation.☆91Updated last year
- Code for "Unifying Molecular and Textual Representations via Multi-task Language Modelling" @ ICML 2023☆37Updated 6 months ago
- This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instru…☆4Updated last week
- Protein-Nucleic Acid Complex Modeling with Frame Averaging Transformer, NeurIPS2024☆25Updated 5 months ago
- Official repository of ReactZyme☆27Updated 5 months ago
- ☆28Updated 2 weeks ago
- An official implementation of Protein Representation Learning via Knowledge Enhanced Primary Structure Reasoning☆27Updated last year
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆94Updated last year
- Joint Sequence-Structure Generation of Nucleic Acid and Protein Complexes with SE(3)-Discrete Diffusion☆56Updated 10 months ago
- ☆10Updated 11 months ago
- diffusion model for protein sequence generation☆49Updated 2 years ago
- ☆28Updated 6 months ago
- ☆66Updated last month