Arcadia-Science / 2023-nr-clustering
Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.
☆23Updated last year
Alternatives and similar repositories for 2023-nr-clustering:
Users that are interested in 2023-nr-clustering are comparing it to the libraries listed below
- Universal and efficient core gene phylogeny with Foldseek and ProstT5☆45Updated this week
- Improved Inference of Ortholog Groups using Hidden Markov Models☆28Updated 3 weeks ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆25Updated last year
- ☆37Updated last month
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆23Updated 9 months ago
- A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code.☆15Updated 5 months ago
- Nail is an Alignment Inference tooL☆43Updated this week
- scripts for predicting natural product activity from biosynthetic gene cluster sequences☆23Updated last year
- a python package for automated generation of phylogenetic trees from genbank files☆16Updated 2 weeks ago
- lsaBGC - Lineage Specific Analysis of Biosynthetic Gene Clusters☆37Updated 5 months ago
- Rapid discovery of novel prophages using biological feature engineering and machine learning☆36Updated 3 weeks ago
- Pipeline for major biological analyses.☆34Updated 2 years ago
- Bacterial Annotation by Learned Representation of Genes☆54Updated 3 years ago
- ☆17Updated 2 years ago
- VirID: An integrated platform for the discovery and characterization of RNA Viruses☆18Updated 2 months ago
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆16Updated this week
- A workflow and scripts for large-scale antiSMASH analyses☆33Updated 3 weeks ago
- BiG-MEx implementation as Docker images and R packages☆12Updated 2 years ago
- Discovery of conserved gene clusters in multiple genomes☆57Updated this week
- ☆19Updated 2 years ago
- uorf4u is a bioinformatics tool for conserved upstream ORF annotation.☆13Updated last year
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆40Updated 2 weeks ago
- Code for detecting genomic island insertions in clades of microbes.☆19Updated last year
- Binnacle: Using Scaffolds to Improve the Contiguity and Quality of Metagenomic Bins☆12Updated 3 years ago
- Nanopore UMI-linked consensus sequencing☆12Updated 3 years ago
- A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.☆21Updated 3 years ago
- ☆30Updated 3 years ago
- The gview wiki migrated to GitHub.☆10Updated 4 years ago
- Bioinformatic Tools for study Evolution of metabolic diversity☆31Updated 10 months ago
- Eukfinder: A pipeline to retrieve microbial eukaryote genomes from metagenomic sequencing data☆16Updated this week