Arcadia-Science / 2023-nr-clusteringLinks
Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.
☆23Updated last week
Alternatives and similar repositories for 2023-nr-clustering
Users that are interested in 2023-nr-clustering are comparing it to the libraries listed below
Sorting:
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆29Updated 7 months ago
- Universal and efficient structure-based core gene phylogeny with Foldseek and ProstT5☆78Updated this week
- GCsnap: interactive snapshots for the comparison of protein-coding genomic contexts☆33Updated last year
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated 2 years ago
- Improved Inference of Ortholog Groups using Hidden Markov Models☆37Updated 2 months ago
- Simple phylogenetic tree visualization python package for phylogenetic analysis☆60Updated last year
- Bacterial Annotation by Learned Representation of Genes☆58Updated 4 years ago
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆23Updated 3 years ago
- Learning and Aligning Large Protein Families with support of protein language models.☆28Updated last week
- Python bindings for the TaxonKit library☆41Updated 4 months ago
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆26Updated 2 weeks ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆36Updated last month
- Nail is an Alignment Inference tooL☆53Updated last month
- A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.☆21Updated 4 years ago
- ☆41Updated 5 months ago
- Pipeline to apply encoded Kmer analysis to protein sequences☆16Updated 2 months ago
- ☆31Updated 10 months ago
- Metagenome analysis using the Kraken software suite☆35Updated 3 years ago
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆49Updated 10 months ago
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆27Updated 2 months ago
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆38Updated 5 months ago
- a python package for automated generation of phylogenetic trees from genbank files☆23Updated last week
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆58Updated 7 months ago
- ☆31Updated 4 years ago
- A command line tool to identify and annotate small proteins in microbial sequencing datasets.☆25Updated 2 years ago
- A Python package for obtaining complete lineages and the lowest common ancestor (LCA) from a set of taxonomic identifiers☆51Updated this week
- Pipeline for major biological analyses.☆35Updated 3 years ago
- Database and pipeline for protein structure-guided annotations of ecologically relevant functions at the metagenome scale.☆24Updated 2 weeks ago
- Bioinformatic Tools for study Evolution of metabolic diversity☆36Updated last year
- ☆36Updated 4 months ago