Build sourmash databases for genbank.
☆11May 4, 2023Updated 2 years ago
Alternatives and similar repositories for databases
Users that are interested in databases are comparing it to the libraries listed below
Sorting:
- benchmarking and containerization of tools for analysis of complex non-clinical metagenomes.☆21Sep 21, 2018Updated 7 years ago
- A database for signatures of public genomic sources☆18Jan 1, 2026Updated 2 months ago
- ☆19Dec 6, 2022Updated 3 years ago
- map Illumina metagenomes to genomes!☆38Jan 22, 2026Updated last month
- fast, multithreaded sourmash operations: search, compare, and gather.☆25Feb 23, 2026Updated last week
- taxonomic classes for Python☆11Aug 30, 2021Updated 4 years ago
- AWS lambda S3 + rust-htslib: A serverless bioinformatics example☆14Jun 7, 2022Updated 3 years ago
- A Python wrapper for the bbhash library for Minimal Perfect Hashing☆19Oct 26, 2025Updated 4 months ago
- kProcessor: kmers processing framework.☆10Oct 1, 2023Updated 2 years ago
- Singular Genomics Demultiplexing Tool☆16Mar 5, 2024Updated last year
- Just another minhash implementation.☆12Feb 23, 2026Updated last week
- Extract lineage CSVs from NCBI for use with sourmash lca.☆29Oct 4, 2023Updated 2 years ago
- Searching large collections of sequencing data with genome-scale queries☆16Feb 5, 2026Updated 3 weeks ago
- Tools for generating and decoding error-correcting DNA barcodes☆15Feb 15, 2022Updated 4 years ago
- The python binding for D4 format☆16Oct 22, 2021Updated 4 years ago
- Tools for working FASTQ files from sequencers (R1/R2/I1/I2)☆12Dec 6, 2024Updated last year
- Experiments with using BIGSI data structure for metagenomic and QC applications☆19Aug 12, 2024Updated last year
- CAMITAX: Taxon labels for microbial genomes☆31Dec 14, 2023Updated 2 years ago
- ☆14Oct 14, 2020Updated 5 years ago
- Benchmarking pairwise aligners☆37Feb 8, 2025Updated last year
- A graph-inspired data structure for determining likely chains of sequences from breadcrumbs of evidence☆17Jun 29, 2021Updated 4 years ago
- Flexible omics pipeline☆17Oct 16, 2025Updated 4 months ago
- ☆16Jan 21, 2024Updated 2 years ago
- Streaming sequence classification with web services ✓📌☆19Dec 8, 2022Updated 3 years ago
- ☆17Sep 20, 2019Updated 6 years ago
- Rust in bioinformatics and computational biology☆19Oct 22, 2022Updated 3 years ago
- Mash MinHash search your nucleotide sequences against a NCBI RefSeq genomes database☆43Dec 11, 2020Updated 5 years ago
- Python and Rust library for loading, saving, and manipulating taxonomic trees☆51Feb 20, 2026Updated last week
- Indexing & querying large assembly graphs -- in space, no one can hear you miao!☆118Jan 29, 2026Updated last month
- A tool for simulating random mutations in any genome☆43Feb 7, 2024Updated 2 years ago
- A Python package for obtaining, parsing and exploring biological taxonomies (GTDB, NCBI, Silva, Greengenes, OTT)☆47Feb 5, 2026Updated 3 weeks ago
- The ECCsplorer is a bioinformatics pipeline for the automated detection of extrachromosomal circular DNA (eccDNA) from paired-end read da…☆20Apr 15, 2024Updated last year
- A long read simulator based on badread idea☆22Oct 7, 2022Updated 3 years ago
- k-mers and the like☆22Feb 23, 2026Updated last week
- An optimal space run-length Burrows-Wheeler transform full-text index☆27Oct 28, 2021Updated 4 years ago
- Indels are not ideal - quick test for interrupted ORFs in bacterial/microbial genomes☆54May 16, 2023Updated 2 years ago
- A fast and space-efficient pre-filter for querying very large collections of nucleotide sequences.☆53Updated this week
- Remove contaminated contigs from genomes using k-mers and taxonomies.☆60Nov 3, 2023Updated 2 years ago
- just annotate it, dammit!☆93Aug 22, 2023Updated 2 years ago