Gaius-Augustus / learnMSA
Learning and Aligning Large Protein Families with support of protein language models.
☆20Updated 3 weeks ago
Alternatives and similar repositories for learnMSA:
Users that are interested in learnMSA are comparing it to the libraries listed below
- Universal and efficient core gene phylogeny with Foldseek and ProstT5☆46Updated last week
- Improved Inference of Ortholog Groups using Hidden Markov Models☆30Updated last month
- Detection of incorrectly labeled sequences across kingdoms☆81Updated 2 years ago
- a python package for automated generation of phylogenetic trees from genbank files☆18Updated last month
- Bacterial Annotation by Learned Representation of Genes☆54Updated 4 years ago
- showTree can visualize the phylogeny, protein sequences and protein domains of a gene family in one figure.☆30Updated 5 years ago
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆20Updated 8 months ago
- Rapid discovery of novel prophages using biological feature engineering and machine learning☆36Updated last month
- Python tool to reduce size and redundancy of phylogenetic datasets☆30Updated last year
- lsaBGC - Lineage Specific Analysis of Biosynthetic Gene Clusters☆37Updated 5 months ago
- Tiberius is a deep learning gene finder.☆46Updated this week
- FastOMA is a scalable software package to infer orthology relationship.☆64Updated this week
- Simple phylogenetic tree visualization python package for phylogenetic analysis☆44Updated 4 months ago
- SHOOT.bio - the phylogenetic search engine☆24Updated last year
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆23Updated this week
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated last year
- PSAURON is a machine learning model for rapid assessment of protein coding gene annotation☆28Updated 2 weeks ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated last year
- A workflow and scripts for large-scale antiSMASH analyses☆34Updated last month
- ☆28Updated this week
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆38Updated last month
- orthology assignment using phylogenetic and network analyses☆46Updated 3 months ago
- VirID: An integrated platform for the discovery and characterization of RNA Viruses☆18Updated 2 months ago
- Bioinformatic Tools for study Evolution of metabolic diversity☆31Updated 11 months ago
- Pipeline for major biological analyses.☆34Updated 2 years ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆30Updated 3 weeks ago
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆32Updated 4 months ago
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆21Updated 2 years ago
- Follow up to Grace Blackwell's 661k dataset, for 2023☆88Updated last month
- genEra is a fast and easy-to-use command-line tool that estimates the age of the last common ancestor of protein-coding gene families.☆48Updated 2 months ago