sourmash-bio/databases

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sourmash-bio/databases)

sourmash-bio / databases

Build sourmash databases for genbank.

☆11

Alternatives and similar repositories for databases

Users that are interested in databases are comparing it to the libraries listed below

Sorting:

dahak-metagenomics / dahak
View on GitHub
benchmarking and containerization of tools for analysis of complex non-clinical metagenomes.
☆21Sep 21, 2018Updated 7 years ago
sourmash-bio / wort
View on GitHub
A database for signatures of public genomic sources
☆18Jan 1, 2026Updated 2 months ago
MetaSeek-Sequencing-Data-Discovery / metaseek
View on GitHub
☆19Dec 6, 2022Updated 3 years ago
dib-lab / genome-grist
View on GitHub
map Illumina metagenomes to genomes!
☆38Jan 22, 2026Updated last month
sourmash-bio / sourmash_plugin_branchwater
View on GitHub
fast, multithreaded sourmash operations: search, compare, and gather.
☆25Feb 23, 2026Updated last week
sckott / pytaxa
View on GitHub
taxonomic classes for Python
☆11Aug 30, 2021Updated 4 years ago
brainstorm / s3-rust-htslib-bam
View on GitHub
AWS lambda S3 + rust-htslib: A serverless bioinformatics example
☆14Jun 7, 2022Updated 3 years ago
dib-lab / pybbhash
View on GitHub
A Python wrapper for the bbhash library for Minimal Perfect Hashing
☆19Oct 26, 2025Updated 4 months ago
dib-lab / kProcessor
View on GitHub
kProcessor: kmers processing framework.
☆10Oct 1, 2023Updated 2 years ago
Singular-Genomics / singular-demux
View on GitHub
Singular Genomics Demultiplexing Tool
☆16Mar 5, 2024Updated last year
St4NNi / jam-rs
View on GitHub
Just another minhash implementation.
☆12Feb 23, 2026Updated last week
dib-lab / 2018-ncbi-lineages
View on GitHub
Extract lineage CSVs from NCBI for use with sourmash lca.
☆29Oct 4, 2023Updated 2 years ago
sourmash-bio / branchwater
View on GitHub
Searching large collections of sequencing data with genome-scale queries
☆16Feb 5, 2026Updated 3 weeks ago
mdshw5 / hamstring
View on GitHub
Tools for generating and decoding error-correcting DNA barcodes
☆15Feb 15, 2022Updated 4 years ago
38 / pyd4
View on GitHub
The python binding for D4 format
☆16Oct 22, 2021Updated 4 years ago
10XGenomics / fastq_set
View on GitHub
Tools for working FASTQ files from sequencers (R1/R2/I1/I2)
☆12Dec 6, 2024Updated last year
hcdenbakker / colorid
View on GitHub
Experiments with using BIGSI data structure for metagenomic and QC applications
☆19Aug 12, 2024Updated last year
CAMI-challenge / CAMITAX
View on GitHub
CAMITAX: Taxon labels for microbial genomes
☆31Dec 14, 2023Updated 2 years ago
luizirber / phd
View on GitHub
☆14Oct 14, 2020Updated 5 years ago
pairwise-alignment / pa-bench
View on GitHub
Benchmarking pairwise aligners
☆37Feb 8, 2025Updated last year
SamStudio8 / hansel
View on GitHub
A graph-inspired data structure for determining likely chains of sequences from breadcrumbs of evidence
☆17Jun 29, 2021Updated 4 years ago
epruesse / ymp
View on GitHub
Flexible omics pipeline
☆17Oct 16, 2025Updated 4 months ago
sourmash-bio / mastiff
View on GitHub
☆16Jan 21, 2024Updated 2 years ago
bede / tictax
View on GitHub
Streaming sequence classification with web services ✓📌
☆19Dec 8, 2022Updated 3 years ago
yesimon / metax_bakeoff_2019
View on GitHub
☆17Sep 20, 2019Updated 6 years ago
arewebioyet / arewebioyet.github.io
View on GitHub
Rust in bioinformatics and computational biology
☆19Oct 22, 2022Updated 3 years ago
phac-nml / refseq_masher
View on GitHub
Mash MinHash search your nucleotide sequences against a NCBI RefSeq genomes database
☆43Dec 11, 2020Updated 5 years ago
onecodex / taxonomy
View on GitHub
Python and Rust library for loading, saving, and manipulating taxonomic trees
☆51Feb 20, 2026Updated last week
spacegraphcats / spacegraphcats
View on GitHub
Indexing & querying large assembly graphs -- in space, no one can hear you miao!
☆118Jan 29, 2026Updated last month
mkpython3 / Mutation-Simulator
View on GitHub
A tool for simulating random mutations in any genome
☆43Feb 7, 2024Updated 2 years ago
pirovc / multitax
View on GitHub
A Python package for obtaining, parsing and exploring biological taxonomies (GTDB, NCBI, Silva, Greengenes, OTT)
☆47Feb 5, 2026Updated 3 weeks ago
crimBubble / ECCsplorer
View on GitHub
The ECCsplorer is a bioinformatics pipeline for the automated detection of extrachromosomal circular DNA (eccDNA) from paired-end read da…
☆20Apr 15, 2024Updated last year
natir / rustyread
View on GitHub
A long read simulator based on badread idea
☆22Oct 7, 2022Updated 3 years ago
oxli-bio / oxli
View on GitHub
k-mers and the like
☆22Feb 23, 2026Updated last week
alshai / r-index
View on GitHub
An optimal space run-length Burrows-Wheeler transform full-text index
☆27Oct 28, 2021Updated 4 years ago
mw55309 / ideel
View on GitHub
Indels are not ideal - quick test for interrupted ORFs in bacterial/microbial genomes
☆54May 16, 2023Updated 2 years ago
seqan / raptor
View on GitHub
A fast and space-efficient pre-filter for querying very large collections of nucleotide sequences.
☆53Updated this week
dib-lab / charcoal
View on GitHub
Remove contaminated contigs from genomes using k-mers and taxonomies.
☆60Nov 3, 2023Updated 2 years ago
dib-lab / dammit
View on GitHub
just annotate it, dammit!
☆93Aug 22, 2023Updated 2 years ago