This is the official repository of Prot2Token paper.
☆38Jun 6, 2025Updated 8 months ago
Alternatives and similar repositories for prot2token
Users that are interested in prot2token are comparing it to the libraries listed below
Sorting:
- ☆21Sep 2, 2025Updated 6 months ago
- scBSP is a specialized package designed for processing biological data, specifically in the analysis of gene expression and cell coordina…☆22Feb 3, 2026Updated 3 weeks ago
- official repo for scPEFT☆51Jan 14, 2026Updated last month
- S-PLM: Structure-aware Protein Language Model via Contrastive Learning between Sequence and Structure☆74Dec 23, 2025Updated 2 months ago
- This is the official repository of GCP-VQVAE: A Geometry-Complete Language for Protein 3D Structure☆39Feb 13, 2026Updated 2 weeks ago
- The first multimodal QA dataset specifically designed for evaluating large TCM language models.☆21Oct 24, 2025Updated 4 months ago
- ☆16Sep 15, 2025Updated 5 months ago
- Diffusion-based generative drug-like molecular editing with chemical natural language☆18Dec 22, 2024Updated last year
- Serializing molecule 3D structures☆14Nov 27, 2024Updated last year
- Contrastive learning harmonizing protein language models and natural language models☆39Jun 12, 2024Updated last year
- 🧬 Fusion of protein sequence and structural information, using denoising pre-training network for zero-shot protein engineering (eLife 2…☆82May 16, 2025Updated 9 months ago
- ☆39Jun 9, 2025Updated 8 months ago
- A conditionally adapted protein language model for the generation of enzymes☆23Nov 26, 2024Updated last year
- ☆12Nov 26, 2023Updated 2 years ago
- Source code for ACL 2024 paper: "ProtT3: Protein-to-Text Generation for Text-based Protein Understanding"☆52May 27, 2024Updated last year
- workflow used to prepare PPB-Affinity database☆28Aug 22, 2024Updated last year
- Implementation of the Pairformer model used in AlphaFold 3☆14Feb 23, 2026Updated last week
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 7 months ago
- Autoregressive fragment-based diffusion for target-aware ligand design☆32May 23, 2024Updated last year
- A Protein Large Language Model for Multi-Task Protein Language Processing☆209Sep 30, 2025Updated 5 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- ☆12Jul 2, 2025Updated 8 months ago
- Deep Learning model for protein and ligand complex structure prediction from sequences and SMILES☆12Oct 31, 2023Updated 2 years ago
- This repository contains information on the creation, evaluation, and benchmark models for the L+M-24 Dataset. L+M-24 will be featured as…☆30Jan 23, 2025Updated last year
- Implementation of the Confidence Bootstrapping procedure for protein-ligand docking.☆27Feb 29, 2024Updated 2 years ago
- Ligand-binding site classification with deep graph neural networks.☆10Sep 24, 2023Updated 2 years ago
- Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]☆33Aug 5, 2023Updated 2 years ago
- The first large protein language model trained follows structure instructions.☆94May 20, 2025Updated 9 months ago
- This study explored prompt-based learning to adapt the state-of-the-art image segmentation foundation model SAM for cryo-EM. Through tria…☆12Apr 27, 2025Updated 10 months ago
- Official implementation of "Learning the language of protein structures"☆41Jun 20, 2025Updated 8 months ago
- SIU: A Million-Scale Structural Small Molecule-Protein Interaction Dataset for Unbiased Bioactivity Prediction☆17Feb 17, 2025Updated last year
- DrugGen: Advancing Drug Discovery with Large Language Models and Reinforcement Learning Feedback☆21May 22, 2025Updated 9 months ago
- Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models …☆267Apr 27, 2025Updated 10 months ago
- Contrastive learning and pre-trained encoder (CLAPE) for protein-small molecules binding (SMB) sites prediction☆19Aug 22, 2024Updated last year
- ☆30Nov 23, 2025Updated 3 months ago
- Code for the paper "OneProt: Towards Multi-Modal Protein Foundation Models"☆21Oct 31, 2025Updated 4 months ago
- Fine-tuning Galactica and Gemma to operate on SMILES. Integrates into a molecular optimization algorithm.☆36Feb 20, 2025Updated last year
- Clusters protein chains based on CA distance difference☆16Feb 4, 2025Updated last year
- ☆22Mar 30, 2024Updated last year