☆110Mar 7, 2022Updated 4 years ago
Alternatives and similar repositories for meaningful-protein-representations
Users that are interested in meaningful-protein-representations are comparing it to the libraries listed below
Sorting:
- Homology reduced UniProt, train-/valid-/testsets for language modeling☆16Apr 20, 2022Updated 3 years ago
- Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different dom…☆734Dec 11, 2022Updated 3 years ago
- a Transformer-based neural network for generating highly optimized protein sequences called Regularized Latent Space Optimization (RELSO)☆89Jul 6, 2023Updated 2 years ago
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆151Mar 10, 2025Updated last year
- DistilProtBert implementation, a distilled version of ProtBert model.☆16Sep 21, 2022Updated 3 years ago
- ☆256Jul 31, 2024Updated last year
- Prediction of binding residues for metal ions, nucleic acids, and small molecules.☆36Sep 2, 2025Updated 6 months ago
- Embedding-based annotation transfer (EAT) uses Euclidean distance between vector representations (embeddings) of proteins to transfer ann…☆41Aug 29, 2025Updated 6 months ago
- Learning Protein Constitutive Motifs from Sequence Data: RBM toolbox☆20Jan 30, 2019Updated 7 years ago
- Evolutionary velocity with protein language models☆98Dec 9, 2025Updated 3 months ago
- test☆14Nov 13, 2020Updated 5 years ago
- Python package to manage protein structures and their annotations☆45Feb 22, 2024Updated 2 years ago
- Protein sequence classification with self-supervised pretraining☆82Nov 29, 2021Updated 4 years ago
- Get protein embeddings from protein sequences☆507Apr 28, 2023Updated 2 years ago
- ☆76Oct 27, 2021Updated 4 years ago
- Deciphering protein evolution and fitness landscapes with latent space models☆37Nov 2, 2021Updated 4 years ago
- This repository is for the paper "A generative nonparametric Bayesian model for whole genomes"☆14Jun 7, 2023Updated 2 years ago
- A generative latent variable model for biological sequence families.☆249Mar 15, 2022Updated 4 years ago
- Web cards/apps describing peptides☆30Apr 26, 2023Updated 2 years ago
- Nextflow pipeline for the computation of structure-based MSAs with AlphaFold2 models☆12Dec 20, 2022Updated 3 years ago
- ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit a…☆1,293May 22, 2025Updated 9 months ago
- A collection of *fold* tools☆302Aug 8, 2025Updated 7 months ago
- Efficient evolution from protein language models☆221Aug 26, 2023Updated 2 years ago
- Inference code for PoET: A generative model of protein families as sequences-of-sequences☆93Apr 24, 2024Updated last year
- Listing of papers about machine learning for proteins.☆1,690May 31, 2024Updated last year
- open source repository☆146Nov 30, 2023Updated 2 years ago
- UniRep model, usage, and examples.☆361Mar 1, 2026Updated 2 weeks ago
- Modelling the Language of Life - Deep Learning Protein Sequences☆76Dec 28, 2020Updated 5 years ago
- RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Ma…☆98Jan 24, 2023Updated 3 years ago
- ☆77Aug 23, 2023Updated 2 years ago
- Official code repository of "BERTology Meets Biology: Interpreting Attention in Protein Language Models."☆305May 1, 2025Updated 10 months ago
- ESM-GearNet for Protein Structure Representation Learning (https://arxiv.org/abs/2303.06275)☆111Oct 23, 2023Updated 2 years ago
- Meta learning addresses noisy and under-labeled data in machine learning-guided antibody engineering (https://doi.org/10.1016/j.cels.2023…☆22Aug 8, 2024Updated last year
- Multi-task and masked language model-based protein sequence embedding models.☆106Jun 16, 2021Updated 4 years ago
- Graph neural network for generating novel amino acid sequences that fold into proteins with predetermined topologies.☆60Mar 24, 2021Updated 4 years ago
- ☆19Oct 8, 2020Updated 5 years ago
- CLANS_2 is a Python-based program for clustering sequences in the 2D or 3D space, based on their sequence similarities. CLANS visualizes …☆26Dec 5, 2024Updated last year
- ASAP-SML: An Antibody Sequence Analysis Pipeline Using Statistical Testing and Machine Learning☆11Jul 6, 2023Updated 2 years ago
- Official repository for the paper "Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval"☆164Aug 24, 2023Updated 2 years ago