alexpreynolds / sampleLinks
Performs memory-efficient reservoir sampling on very large input files delimited by newlines
☆69Updated 5 years ago
Alternatives and similar repositories for sample
Users that are interested in sample are comparing it to the libraries listed below
Sorting:
- Load numpy arrays and HDF5 files from VCF (variant call format)☆31Updated 8 years ago
- Fast and memory-efficient sequencing error corrector☆93Updated last year
- utilities for indexing and sequence extraction from FASTA files☆59Updated 4 years ago
- Enhanced Artificial Genome Engine: next generation sequencing reads simulator☆33Updated 5 years ago
- Squeakr: An Exact and Approximate k -mer Counting System☆85Updated 5 months ago
- FM-index representation of a de Bruijn graph☆27Updated 8 years ago
- Implicit Interval Tree with Interpolation Index☆41Updated 3 years ago
- Mantis: A Fast, Small, and Exact Large-Scale Sequence-Search Index☆81Updated last year
- Streaming algorithm for computing kmer statistics for massive genomics datasets☆54Updated 5 years ago
- Parallel Block GZIP☆50Updated 9 years ago
- Code accompanying the publication for compressed graph annotation☆13Updated 6 years ago
- Incremental construction of FM-index for DNA sequences☆71Updated last year
- vgraph is a command line application and Python library to compare genetic variants using variant graphs. ``vgraph`` utilizes a graph re…☆43Updated 3 years ago
- Cosmo is a fast, low-memory DNA assembler using a Succinct (variable order) de Bruijn Graph.☆52Updated last year
- Streaming relation (overlap, distance, KNN) of (any number of) sorted genomic interval sets. #golang☆47Updated 5 years ago
- Software accompanying "HINGE: Long-Read Assembly Achieves Optimal Repeat Resolution"☆64Updated 4 years ago
- An Oxford Nanopore Basecaller☆71Updated 3 years ago
- An experimental tool to find approximate max-cuts in a large graph☆11Updated 4 years ago
- Lacer: Accurate Base Quality Score Recalibration using Linear Algebra☆8Updated 3 years ago
- Stupid Simple Structural Variant View☆25Updated 8 years ago
- normalize, left-align, trim, validate and clean VCF files☆20Updated 10 years ago
- Flexible genotype query among 30,000+ samples whole-genome☆95Updated 5 years ago
- Fast spliced aligner with low memory requirements☆41Updated 9 years ago
- pythonic wrapper for libhts (moved to: https://github.com/quinlan-lab/hts-python)☆49Updated 8 years ago
- Sparse Project VCF: evolution of VCF to encode population genotype matrices efficiently☆58Updated last year
- ☆21Updated 10 years ago
- Bonsai: Fast, flexible taxonomic analysis and classification☆71Updated last year
- BWT-based index for graphs☆71Updated 4 months ago
- SV detection from paired end reads mapping☆38Updated 15 years ago
- Practical Dynamic de Bruijn Graphs☆18Updated 4 years ago