sourmash-bio / databases
Build sourmash databases for genbank.
☆12Updated last year
Alternatives and similar repositories for databases:
Users that are interested in databases are comparing it to the libraries listed below
- ☆19Updated 8 years ago
- fast, multithreaded sourmash operations: search, compare, and gather.☆22Updated this week
- benchmarking and containerization of tools for analysis of complex non-clinical metagenomes.☆21Updated 6 years ago
- Alternative taxonomic consensus algorithms based on the NCBI taxonomy tree☆14Updated 9 months ago
- ☆23Updated 5 years ago
- Find Unique genomic Regions☆29Updated 2 months ago
- Given a set of kmers (fasta format) and a set of sequences (fasta format), this tool will extract the sequences containing the kmers.☆21Updated last year
- Classify sequencing reads using MinHash.☆48Updated 4 years ago
- Paint genomes with taxa-specific k-mer probabilities☆8Updated 7 years ago
- Read contamination removal☆24Updated last year
- A long read simulator based on badread idea☆22Updated 2 years ago
- software to identify primers that can distinguish genomes☆21Updated 2 months ago
- Output FASTQ summary statistics in JSON format☆29Updated 2 years ago
- a Contig Alignment Tool for Pairwise Assembly Comparison☆13Updated 5 years ago
- Collection of utilities for working with PacBio-based assemblies☆13Updated last year
- blast, shmlast☆22Updated 4 years ago
- Variant call verification☆16Updated 2 months ago
- Variant call adjudication☆16Updated 9 months ago
- Iterate over minimizers of a DNA sequence☆28Updated 8 months ago
- Calculate genome wide average nucleotide identity (gwANI) for a multiFASTA alignment☆16Updated 6 years ago
- Rapid competitive read demulitplexer. Made with tries.☆24Updated last year
- Minimizer-based assembly scaffolding and mapping using long reads☆37Updated 5 months ago
- A method of assessing sequence complexity based on kmer frequencies☆31Updated 6 years ago
- Symmetric DUST for finding low-complexity regions in DNA sequences☆36Updated last year
- Strain-level abundances estimation in metagenomic samples using variation graphs☆25Updated 2 years ago
- Reference-free Binning of Metagenomics Long Reads using Coverage and Composition☆20Updated 6 months ago
- Experiments with using BIGSI data structure for metagenomic and QC applications☆19Updated 7 months ago
- Ampseer examines reads in fastq format and identifies which multiplex PCR primer set was used to generate the SARS-CoV-2 sequencing libra…☆14Updated last year
- PyO3 bindings and Python interface to skani, a method for fast genomic identity calculation using sparse chaining.☆27Updated last week
- Strain resolved metagenome simulator☆13Updated 6 years ago