alexpreynolds / sampleLinks
Performs memory-efficient reservoir sampling on very large input files delimited by newlines
☆69Updated 5 years ago
Alternatives and similar repositories for sample
Users that are interested in sample are comparing it to the libraries listed below
Sorting:
- Squeakr: An Exact and Approximate k -mer Counting System☆86Updated 11 months ago
- Fast and memory-efficient sequencing error corrector☆94Updated 3 weeks ago
- utilities for indexing and sequence extraction from FASTA files☆59Updated 4 years ago
- Mantis: A Fast, Small, and Exact Large-Scale Sequence-Search Index☆85Updated last year
- dynamic-updateable-index☆11Updated 10 years ago
- Enhanced Artificial Genome Engine: next generation sequencing reads simulator☆33Updated 5 years ago
- FM-index representation of a de Bruijn graph☆26Updated 8 years ago
- Streaming algorithm for computing kmer statistics for massive genomics datasets☆54Updated 5 years ago
- Implicit Interval Tree with Interpolation Index☆42Updated 3 years ago
- Incremental construction of FM-index for DNA sequences☆72Updated last year
- Code accompanying the publication for compressed graph annotation☆13Updated 6 years ago
- Parallel Block GZIP☆50Updated 9 years ago
- Smith-Waterman database searches with inter-sequence SIMD parallelisation☆60Updated 2 years ago
- An alignment-free, reference-free and incremental data structure for colored de Bruijn graph with application to pan-genome indexing.☆44Updated 4 years ago
- Software accompanying "HINGE: Long-Read Assembly Achieves Optimal Repeat Resolution"☆63Updated 5 years ago
- BWT-based index for graphs☆73Updated 10 months ago
- Cosmo is a fast, low-memory DNA assembler using a Succinct (variable order) de Bruijn Graph.☆53Updated last year
- de Bruijn CompAction in Low Memory☆23Updated 10 years ago
- ☆73Updated 6 years ago
- A fast constructor of the compressed de Bruijn graph from many genomes☆42Updated 2 months ago
- An Oxford Nanopore Basecaller☆70Updated 4 years ago
- Practical Dynamic de Bruijn Graphs☆18Updated 5 years ago
- a toolset for fast DNA read set matching and assembly using a new type of reduced kmer☆37Updated 4 years ago
- Streaming relation (overlap, distance, KNN) of (any number of) sorted genomic interval sets. #golang☆47Updated 5 years ago
- normalize, left-align, trim, validate and clean VCF files☆20Updated 10 years ago
- BlastGraph is a new tool for computing intensive approximate pattern matching in a sequence graph or a de-Bruijn graph. Given an oriented…☆12Updated 12 years ago
- An integrated high performance bioinformatics toolkit☆23Updated 6 years ago
- tools for error correction and working with long read data☆44Updated 11 years ago
- MinHash Alignment Process (MHAP, pronounced MAP): locality-sensitive hashing to detect long-read overlaps and utilities☆98Updated 3 years ago
- succinct labeled graphs with collections and paths☆15Updated 7 years ago