Multi-task and masked language model-based protein sequence embedding models.
☆106Jun 16, 2021Updated 4 years ago
Alternatives and similar repositories for prose
Users that are interested in prose are comparing it to the libraries listed below
Sorting:
- Source code for "Learning protein sequence embeddings using information from structure" - ICLR 2019☆262Jun 16, 2021Updated 4 years ago
- Get protein embeddings from protein sequences☆507Apr 28, 2023Updated 2 years ago
- ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit a…☆1,293May 22, 2025Updated 9 months ago
- ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit a…☆11Jun 2, 2022Updated 3 years ago
- Repository for publicly available deep learning models developed in Rosetta community☆123Sep 18, 2021Updated 4 years ago
- Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different dom…☆734Dec 11, 2022Updated 3 years ago
- Codebase for our preprint using trRosetta to design proteins with discontinuous functional sites, found here: https://www.biorxiv.org/con…☆16Oct 27, 2021Updated 4 years ago
- Official code repository of "BERTology Meets Biology: Interpreting Attention in Protein Language Models."☆305May 1, 2025Updated 10 months ago
- Evolutionary velocity with protein language models☆98Dec 9, 2025Updated 3 months ago
- Implementation of Protein Classification based on subcellular localization using ProtBert(Rostlab/prot_bert_bfd_localization) model from …☆42May 3, 2024Updated last year
- A collection of tasks to probe the effectiveness of protein sequence representations in modeling aspects of protein design☆117Mar 3, 2026Updated 2 weeks ago
- Public release of Ptolemy package for automated targeting of Cryo-EM grids☆18Mar 9, 2025Updated last year
- Embedding-based annotation transfer (EAT) uses Euclidean distance between vector representations (embeddings) of proteins to transfer ann…☆41Aug 29, 2025Updated 6 months ago
- DistilProtBert implementation, a distilled version of ProtBert model.☆16Sep 21, 2022Updated 3 years ago
- A compilation of deep learning methods for protein design☆97Nov 5, 2022Updated 3 years ago
- ☆110Mar 7, 2022Updated 4 years ago
- Prediction of binding residues for metal ions, nucleic acids, and small molecules.☆36Sep 2, 2025Updated 6 months ago
- An all-atom protein structure dataset for machine learning.☆360Mar 16, 2024Updated 2 years ago
- Simple python interface for the OpenProtein.AI REST API.☆15Updated this week
- My work on building a deep neural network for fast and accurate protein protein interaction prediction☆11Mar 13, 2024Updated 2 years ago
- Unsupervised neural network for learning embeddings of GO terms.☆21Feb 19, 2022Updated 4 years ago
- Homology reduced UniProt, train-/valid-/testsets for language modeling☆16Apr 20, 2022Updated 3 years ago
- Web application for Rxivist, the site that makes it easier to find the most talked-about papers on bioRxiv.org☆10Mar 1, 2023Updated 3 years ago
- Geometric Vector Perceptron --- a rotation-equivariant GNN for learning from biomolecular structure☆164May 1, 2021Updated 4 years ago
- Language modeling of viral evolution☆151Mar 24, 2023Updated 2 years ago
- RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Ma…☆98Jan 24, 2023Updated 3 years ago
- Official repository for the paper "Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval"☆164Aug 24, 2023Updated 2 years ago
- ☆256Jul 31, 2024Updated last year
- Primary RNA sequence model☆43May 20, 2024Updated last year
- Multi-study integration of cellular trajectories☆19Jun 1, 2020Updated 5 years ago
- pretrained LookingGlass language model for biological read-length DNA sequences, and related models derived from transfer learning☆15Feb 19, 2026Updated last month
- Code for the antibody deep learning paper.☆16Feb 28, 2022Updated 4 years ago
- repDNA is a Python package to generate various features of DNA sequences incorporating physicochemical properties and sequence-order effe…☆13Jul 16, 2022Updated 3 years ago
- open source repository☆146Nov 30, 2023Updated 2 years ago
- Interpretation by Deep Generative Masking for Biological Sequences☆37Dec 9, 2021Updated 4 years ago
- Official Pytorch implementation of PLUS (Protein sequence representations Learned Using Structural information), IEEE Access 2021☆39Sep 5, 2023Updated 2 years ago
- ☆135Jun 3, 2025Updated 9 months ago
- ☆192Feb 8, 2022Updated 4 years ago
- DeepGraphGO: graph neural network for large-scale, multispecies protein function prediction☆35Jul 28, 2021Updated 4 years ago