Arcadia-Science / 2023-nr-clustering
Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.
☆23Updated last year
Alternatives and similar repositories for 2023-nr-clustering:
Users that are interested in 2023-nr-clustering are comparing it to the libraries listed below
- Universal and efficient core gene phylogeny with Foldseek and ProstT5☆55Updated this week
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆27Updated this week
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated 2 years ago
- a python package for automated generation of phylogenetic trees from genbank files☆21Updated last month
- Pipeline for major biological analyses.☆34Updated 3 years ago
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆41Updated 3 months ago
- ☆38Updated this week
- lsaBGC - Lineage Specific Analysis of Biosynthetic Gene Clusters☆37Updated last week
- ☆31Updated 3 years ago
- Simple phylogenetic tree visualization python package for phylogenetic analysis☆48Updated 7 months ago
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆20Updated 3 weeks ago
- Bacterial Annotation by Learned Representation of Genes☆55Updated 4 years ago
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆48Updated 2 weeks ago
- Improved Inference of Ortholog Groups using Hidden Markov Models☆33Updated last month
- Maximum likelihood structural phylogenetics by including Foldseek 3Di characters. Supporting Information for Puente-Lelievre et al. 2023n…☆20Updated 9 months ago
- Nail is an Alignment Inference tooL☆45Updated last week
- A Python package for obtaining complete lineages and the lowest common ancestor (LCA) from a set of taxonomic identifiers☆47Updated 2 months ago
- Discovery of conserved gene clusters in multiple genomes☆79Updated 2 weeks ago
- A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code.☆17Updated 7 months ago
- Nanopore UMI-linked consensus sequencing☆14Updated 4 years ago
- BiG-MEx implementation as Docker images and R packages☆13Updated 3 years ago
- Rapid discovery of novel prophages using biological feature engineering and machine learning☆37Updated 3 months ago
- Code for detecting genomic island insertions in clades of microbes.☆19Updated last year
- A workflow and scripts for large-scale antiSMASH analyses☆39Updated last week
- Metagenome analysis using the Kraken software suite☆31Updated 2 years ago
- ☆29Updated last month
- A fork of Prodigal meant to improve gene calling for giant viruses and viruses that use alternative genetic codes☆35Updated last year
- ☆17Updated 2 years ago
- alignment-free coverage calculation for metagenomic binning >100 times faster☆40Updated last month
- MacSyFinder models allowing for a systematic search of anti-phage systems☆22Updated 2 months ago