liudan111 / EvoMIL
Prediction of virus-host association using protein language models and multiple instance learning
☆17Updated 11 months ago
Alternatives and similar repositories for EvoMIL
Users that are interested in EvoMIL are comparing it to the libraries listed below
Sorting:
- GCN classifier for phage taxanomy classification☆31Updated 6 months ago
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆24Updated this week
- ☆11Updated 2 months ago
- ☆24Updated last year
- MGM (Microbial General Model) as a large-scaled pretrained language model for interpretable microbiome data analysis.☆21Updated 3 weeks ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated 2 years ago
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆27Updated last month
- Universal and efficient core gene phylogeny with Foldseek and ProstT5☆55Updated last month
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆31Updated 2 months ago
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆21Updated last year
- Protein Sequence Annotation with Language Models☆20Updated 6 months ago
- Host prediction for phages☆25Updated 6 months ago
- Python bindings for the TaxonKit library☆39Updated this week
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆36Updated 8 months ago
- Phage virion protein classifier☆12Updated 6 months ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated last year
- Maximum likelihood structural phylogenetics by including Foldseek 3Di characters. Supporting Information for Puente-Lelievre et al. 2023n…☆21Updated last week
- DeepECtransformer☆24Updated last year
- ☆13Updated 2 years ago
- Machine learning for accurate identification and classification of CRISPR-Cas systems☆22Updated 4 months ago
- scripts for predicting natural product activity from biosynthetic gene cluster sequences☆23Updated 2 weeks ago
- Learning and Aligning Large Protein Families with support of protein language models.☆23Updated this week
- A software-suite to perform multiple protein structure alignment and structure feature extraction.☆29Updated last year
- A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.☆21Updated 3 years ago
- A PyTorch implementation of "Automatic Identification and Virtual Directed Evolution of Antimicrobial Peptides with Explainable Deep Lear…☆25Updated 2 months ago
- Workflow to download, process, and explore microbial RNA-seq data from NCBI SRA☆15Updated 6 months ago
- ☆11Updated 5 months ago
- The 3DFI pipeline predicts the 3D structure of proteins and searches for structural homology in the 3D space.☆19Updated last year
- Nextflow pipeline for the computation of structure-based MSAs with AlphaFold2 models☆12Updated 2 years ago
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆20Updated last month