Arcadia-Science / 2023-nr-clusteringLinks
Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.
☆23Updated 2 years ago
Alternatives and similar repositories for 2023-nr-clustering
Users that are interested in 2023-nr-clustering are comparing it to the libraries listed below
Sorting:
- Universal and efficient structure-based core gene phylogeny with Foldseek and ProstT5☆70Updated last month
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆27Updated 3 months ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated 2 years ago
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆24Updated 2 years ago
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆44Updated 7 months ago
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆56Updated 4 months ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆35Updated last month
- Simple phylogenetic tree visualization python package for phylogenetic analysis☆50Updated 10 months ago
- A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code.☆17Updated 11 months ago
- lsaBGC - Lineage Specific Analysis of Biosynthetic Gene Clusters☆37Updated 3 months ago
- ☆39Updated last month
- Bacterial Annotation by Learned Representation of Genes☆57Updated 4 years ago
- A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.☆21Updated 3 years ago
- Pipeline for major biological analyses.☆35Updated 3 years ago
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆21Updated 4 months ago
- a python package for automated generation of phylogenetic trees from genbank files☆22Updated 4 months ago
- Discovery of conserved gene clusters in multiple genomes☆83Updated 2 months ago
- Code for detecting genomic island insertions in clades of microbes.☆19Updated last year
- Nail is an Alignment Inference tooL☆50Updated this week
- Improved Inference of Ortholog Groups using Hidden Markov Models☆37Updated 5 months ago
- Nanopore UMI-linked consensus sequencing☆16Updated 4 years ago
- GCsnap: interactive snapshots for the comparison of protein-coding genomic contexts☆30Updated last year
- Bioinformatic Tools for study Evolution of metabolic diversity☆34Updated last year
- ☆31Updated 3 weeks ago
- Database and pipeline for protein structure-guided annotations of ecologically relevant functions at the metagenome scale.☆17Updated 3 weeks ago
- Fast and accurate tool for calculating Average Nucleotide Identity (ANI) and clustering virus genomes and metagenomes☆87Updated 2 months ago
- ☆31Updated 3 years ago
- Python bindings for the TaxonKit library☆41Updated last month
- Rapid discovery of novel prophages using biological feature engineering and machine learning☆37Updated 7 months ago
- Workflow to rapidly quantify taxa from all domains of life, directly from short-read human gut metagenomes☆65Updated last month