Arcadia-Science / 2023-nr-clusteringLinks
Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.
☆23Updated 2 years ago
Alternatives and similar repositories for 2023-nr-clustering
Users that are interested in 2023-nr-clustering are comparing it to the libraries listed below
Sorting:
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆29Updated 6 months ago
- Universal and efficient structure-based core gene phylogeny with Foldseek and ProstT5☆77Updated 4 months ago
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆23Updated 2 years ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated 2 years ago
- Improved Inference of Ortholog Groups using Hidden Markov Models☆37Updated last month
- GCsnap: interactive snapshots for the comparison of protein-coding genomic contexts☆32Updated last year
- ☆41Updated 4 months ago
- Simple phylogenetic tree visualization python package for phylogenetic analysis☆58Updated last year
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆57Updated 7 months ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆36Updated 2 weeks ago
- Bacterial Annotation by Learned Representation of Genes☆58Updated 4 years ago
- Generation and updating of protein families☆21Updated last week
- Learning and Aligning Large Protein Families with support of protein language models.☆28Updated last week
- A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.☆21Updated 4 years ago
- lsaBGC - Lineage Specific Analysis of Biosynthetic Gene Clusters☆38Updated last month
- ☆31Updated 4 years ago
- A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code.☆20Updated 2 months ago
- Python bindings for the TaxonKit library☆40Updated 4 months ago
- Database and pipeline for protein structure-guided annotations of ecologically relevant functions at the metagenome scale.☆23Updated last month
- Pipeline for major biological analyses.☆35Updated 3 years ago
- ☆19Updated 2 years ago
- Nanopore UMI-linked consensus sequencing☆16Updated 4 years ago
- Machine learning for accurate identification and classification of CRISPR-Cas systems☆23Updated 10 months ago
- a python package for automated generation of phylogenetic trees from genbank files☆23Updated 7 months ago
- BiG-MEx implementation as Docker images and R packages☆13Updated 3 years ago
- Workflow to rapidly quantify taxa from all domains of life, directly from short-read human gut metagenomes☆64Updated last month
- Software for predicting translation initiation rates in bacteria☆29Updated 2 months ago
- A Python package for obtaining complete lineages and the lowest common ancestor (LCA) from a set of taxonomic identifiers☆50Updated 8 months ago
- Scalable Maximum Likelihood Estimation of Phylogenetic Models☆17Updated 7 months ago
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆22Updated 7 months ago