Arcadia-Science / 2023-nr-clusteringLinks
Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.
☆23Updated 2 years ago
Alternatives and similar repositories for 2023-nr-clustering
Users that are interested in 2023-nr-clustering are comparing it to the libraries listed below
Sorting:
- Universal and efficient structure-based core gene phylogeny with Foldseek and ProstT5☆73Updated 2 months ago
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆28Updated 4 months ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated 2 years ago
- Learning and Aligning Large Protein Families with support of protein language models.☆27Updated this week
- Python bindings for the TaxonKit library☆41Updated 2 months ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆35Updated last week
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆56Updated 5 months ago
- Improved Inference of Ortholog Groups using Hidden Markov Models☆37Updated 5 months ago
- ☆39Updated 2 months ago
- Bacterial Annotation by Learned Representation of Genes☆58Updated 4 years ago
- Discovery of conserved gene clusters in multiple genomes☆83Updated 3 months ago
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆24Updated 2 years ago
- GCsnap: interactive snapshots for the comparison of protein-coding genomic contexts☆30Updated last year
- Nail is an Alignment Inference tooL☆51Updated last week
- Simple phylogenetic tree visualization python package for phylogenetic analysis☆53Updated 11 months ago
- a python package for automated generation of phylogenetic trees from genbank files☆22Updated 5 months ago
- ☆33Updated last month
- Influenza genome analysis Nextflow workflow☆28Updated 3 weeks ago
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆46Updated 8 months ago
- Fast and accurate tool for calculating Average Nucleotide Identity (ANI) and clustering virus genomes and metagenomes☆88Updated 3 months ago
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆21Updated 5 months ago
- Metagenome analysis using the Kraken software suite☆32Updated 2 years ago
- Code for detecting genomic island insertions in clades of microbes.☆19Updated last year
- Workflow to rapidly quantify taxa from all domains of life, directly from short-read human gut metagenomes☆65Updated last month
- PHANOTATE: a gene caller for phages.☆78Updated 8 months ago
- ☆19Updated 2 years ago
- A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.☆21Updated 3 years ago
- lsaBGC - Lineage Specific Analysis of Biosynthetic Gene Clusters☆38Updated 4 months ago
- A Python package for obtaining complete lineages and the lowest common ancestor (LCA) from a set of taxonomic identifiers☆51Updated 6 months ago
- A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code.☆18Updated last week