circulosmeos / gztool
extract random-positioned data from gzip files with no penalty, including gzip tailing like with 'tail -f' !
☆138Updated 5 months ago
Alternatives and similar repositories for gztool:
Users that are interested in gztool are comparing it to the libraries listed below
- Fast parallel random access to bzip2 and gzip files in Python☆78Updated last week
- Truly parallel gzip decompression☆125Updated 5 years ago
- Open compressed files in Python☆74Updated last month
- Parallel Block GZIP☆50Updated 8 years ago
- Faster zlib and gzip compatible compression and decompression by providing python bindings for the isa-l library.☆49Updated last month
- A multi-threading implement of Python gzip module☆55Updated last year
- 📖 🧬 SSHash is a compressed, associative, exact, and weighted dictionary for k-mers.☆84Updated last week
- Genomics Extension for SQLite☆162Updated 8 months ago
- Augmented Interval Tree implemented in Cython/C☆20Updated 3 months ago
- Fast random access of gzip files in Python☆107Updated 3 months ago
- Compress a file into a seekable zstd with special handling for .tar archives☆64Updated 2 years ago
- COBS - Compact Bit-Sliced Signature Index (for Genomic k-Mer Data or q-Grams)☆84Updated last year
- A 'time'-like utility for Unix that measures peak memory usage☆66Updated 12 years ago
- Parallel bzip2 utility☆142Updated 2 years ago
- The Nested Containment List for Python. Basically a static interval-tree that is silly fast for both construction and lookups.☆220Updated 8 months ago
- Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms☆30Updated last month
- Efficient variant-call data storage and retrieval library using the TileDB storage library.☆93Updated this week
- Bonsai: Fast, flexible taxonomic analysis and classification☆70Updated last year
- cgmemtime measures the high-water RSS+CACHE memory usage of a process and its descendant processes.☆115Updated 2 weeks ago
- mirror of GNU Datamash. Send questions/comments/bugs to bug-datamash@gnu.org☆75Updated last month
- C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings☆152Updated 9 months ago
- Reference server implementation for the GA4GH HTSget API standard.☆12Updated last year
- A multi-threading implement of Python gzip module☆57Updated last year
- A command line program for large scale buffering between piped programs☆15Updated 3 years ago
- A very fast interval tree data structure☆117Updated 3 months ago
- A library that mimic fread, fseek and ftell for reading zstd compressed files.☆21Updated 2 years ago
- deBGR: An Efficient and Near-Exact Representation of the Weighted de Bruijn Graph☆30Updated 4 years ago
- Fast(er) statistics from the command line.☆93Updated 3 years ago
- Implicit Interval Tree with Interpolation Index☆41Updated 2 years ago
- Performs memory-efficient reservoir sampling on very large input files delimited by newlines☆69Updated 5 years ago