sourmash-bio / databases
Build sourmash databases for genbank.
☆12Updated 2 years ago
Alternatives and similar repositories for databases:
Users that are interested in databases are comparing it to the libraries listed below
- benchmarking and containerization of tools for analysis of complex non-clinical metagenomes.☆21Updated 6 years ago
- Indel-aware consensus for aligned BAM☆21Updated last month
- fast, multithreaded sourmash operations: search, compare, and gather.☆22Updated this week
- Experiments with using BIGSI data structure for metagenomic and QC applications☆19Updated 8 months ago
- Given a set of kmers (fasta format) and a set of sequences (fasta format), this tool will extract the sequences containing the kmers.☆21Updated last year
- Alternative taxonomic consensus algorithms based on the NCBI taxonomy tree☆16Updated 11 months ago
- Collection of utilities for working with PacBio-based assemblies☆13Updated 2 years ago
- Find Unique genomic Regions☆29Updated last month
- Filter of Pairwise Alignement☆44Updated 3 years ago
- Extract lineage CSVs from NCBI for use with sourmash lca.☆28Updated last year
- blast, shmlast☆22Updated 4 years ago
- A long read simulator based on badread idea☆22Updated 2 years ago
- a Contig Alignment Tool for Pairwise Assembly Comparison☆13Updated 5 years ago
- software to identify primers that can distinguish genomes☆21Updated 3 months ago
- ☆19Updated 8 years ago
- Accurate, Lightweight Clustering of de novo Transcriptomes using Fragment Equivalence Classes☆31Updated 10 months ago
- Minimizer-based assembly scaffolding and mapping using long reads☆39Updated 6 months ago
- reference free variant assembly☆33Updated last year
- Viral genome coverage evaluation for metagenomic diagnostics☆28Updated 4 months ago
- Rapid competitive read demulitplexer. Made with tries.☆23Updated last year
- Scaffolding with assembly likelihood optimization☆22Updated 4 years ago
- Read contamination removal☆25Updated last year
- A simple tool to fix PacBio fasta/q that was not properly split into subreads☆15Updated 3 years ago
- Symmetric DUST for finding low-complexity regions in DNA sequences☆38Updated last year
- Dividing heterogeneous long-read sequencing into groups with de Bruijn graphs☆21Updated 5 months ago
- Output FASTQ summary statistics in JSON format☆30Updated 2 years ago
- Paint genomes with taxa-specific k-mer probabilities☆8Updated 7 years ago
- SAMsift: advanced filtering and tagging of SAM/BAM alignments using Python expressions.☆23Updated 7 years ago
- Classify sequencing reads using MinHash.☆48Updated 5 years ago
- Haplotype-aware genome assembly toolkit☆29Updated 5 years ago