alexpreynolds / sample
Performs memory-efficient reservoir sampling on very large input files delimited by newlines
☆69Updated 4 years ago
Related projects: ⓘ
- utilities for indexing and sequence extraction from FASTA files☆58Updated 3 years ago
- Fast and memory-efficient sequencing error corrector☆92Updated 4 months ago
- Squeakr: An Exact and Approximate k -mer Counting System☆85Updated 7 months ago
- Code accompanying the publication for compressed graph annotation☆13Updated 5 years ago
- Streaming algorithm for computing kmer statistics for massive genomics datasets☆53Updated 4 years ago
- Cosmo is a fast, low-memory DNA assembler using a Succinct (variable order) de Bruijn Graph.☆51Updated 6 months ago
- Enhanced Artificial Genome Engine: next generation sequencing reads simulator☆32Updated 4 years ago
- pythonic wrapper for libhts (moved to: https://github.com/quinlan-lab/hts-python)☆49Updated 7 years ago
- Load numpy arrays and HDF5 files from VCF (variant call format)☆31Updated 7 years ago
- FM-index representation of a de Bruijn graph☆27Updated 7 years ago
- A fast constructor of the compressed de Bruijn graph from many genomes☆39Updated last year
- Streaming relation (overlap, distance, KNN) of (any number of) sorted genomic interval sets. #golang☆47Updated 4 years ago
- An alignment-free, reference-free and incremental data structure for colored de Bruijn graph with application to pan-genome indexing.☆43Updated 2 years ago
- Bonsai: Fast, flexible taxonomic analysis and classification☆70Updated 5 months ago
- succinct labeled graphs with collections and paths☆15Updated 5 years ago
- normalize, left-align, trim, validate and clean VCF files☆20Updated 9 years ago
- various tools to download, convert and process the full text of scientific articles☆52Updated last year
- MinHash Alignment Process (MHAP, pronounced MAP): locality-sensitive hashing to detect long-read overlaps and utilities☆96Updated 2 years ago
- Incremental construction of FM-index for DNA sequences☆67Updated 3 months ago
- ☆13Updated this week
- Histosketching Using Little Kmers☆55Updated last year
- deBGR: An Efficient and Near-Exact Representation of the Weighted de Bruijn Graph☆30Updated 3 years ago
- Efficient handling of FASTQ files from Python☆50Updated 2 weeks ago
- ☆73Updated 5 years ago
- Fast calculations of linkage-disequilibrium in large-scale human cohorts☆41Updated 4 years ago
- SVG based genome viewer written in javascript using D3☆33Updated 9 years ago
- Software accompanying "HINGE: Long-Read Assembly Achieves Optimal Repeat Resolution"☆64Updated 3 years ago
- A Variant Call Format reader for Python.☆73Updated 9 years ago
- Learn interpretable computational phenotyping models from k-merized genomic data☆50Updated 2 years ago
- Streaming sequence classification with web services ✓📌☆19Updated last year