HKUNLP / RSA
Retrieved Sequence Augmentation for Protein Representation Learning
☆45Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RSA
- Exploring Evolution-aware & free protein language models as protein function predictors☆60Updated last month
- Code for 'On Pre-trained Language Models For Antibody'☆30Updated last year
- [ACL 2024] ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training☆37Updated 7 months ago
- PEER Benchmark, appear at NeurIPS 2022 Dataset and Benchmark Track (https://arxiv.org/abs/2206.02096)☆82Updated last year
- MSAGPT☆25Updated 4 months ago
- Code for "Unifying Molecular and Textual Representations via Multi-task Language Modelling" @ ICML 2023☆35Updated 2 months ago
- NeurIPS 2023 Spotlight paper: Full atom protein pocket design via iterative refinement☆44Updated last year
- PyTorch code for KDD 2023 paper "Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design"☆49Updated 11 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆95Updated last month
- Source code of the paper "Protein Sequence and Structure Co-Design with Equivariant Translation"☆22Updated last year
- Must-read papers on NLP for science.☆55Updated last year
- Source code of PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications.☆29Updated 3 months ago
- The first large protein language model trained follows structure instructions.☆71Updated 5 months ago
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆86Updated last year
- This repository contains information on the creation, evaluation, and benchmark models for the L+M-24 Dataset. L+M-24 will be featured as…☆26Updated 3 months ago
- ☆42Updated 3 months ago
- ☆37Updated last year
- [RECOMB 2023] Official implementation of "Pisces: A combo-wise contrastive learning approach to synergistic drug combination prediction".☆14Updated 11 months ago
- This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instru…☆3Updated this week
- Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]☆31Updated last year
- ☆48Updated 5 months ago
- ☆39Updated 6 months ago
- Official repository for "Plug & Play Directed Evolution for Proteins with Gradient-Based Discrete MCMC"☆11Updated last year
- Official Implemetation of DPLM (ICML'24) - Diffusion Language Models Are Versatile Protein Learners☆72Updated 3 weeks ago
- A comprehensive repository dedicated to the collection and exploration of studies utilizing Large Language Models for molecular design, p…☆40Updated last year
- Code for paper: "Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design"☆24Updated 3 weeks ago
- Joint Sequence-Structure Generation of Nucleic Acid and Protein Complexes with SE(3)-Discrete Diffusion☆51Updated 6 months ago
- A comprehensive benchmark on the performances of multiple protein backbone generative models.☆31Updated this week
- [ICLR 2023] "HotProtein: A Novel Framework for Protein Thermostability Prediction and Editing" by Tianlong Chen*, Chengyue Gong*, Daniel …☆27Updated last year