circulosmeos / gztoolLinks
extract random-positioned data from gzip files with no penalty, including gzip tailing like with 'tail -f' !
☆149Updated last year
Alternatives and similar repositories for gztool
Users that are interested in gztool are comparing it to the libraries listed below
Sorting:
- Truly parallel gzip decompression☆124Updated 6 years ago
- Fast parallel random access to bzip2 and gzip files in Python☆84Updated last month
- mirror of GNU Datamash. Send questions/comments/bugs to bug-datamash@gnu.org☆77Updated last month
- Genomics Extension for SQLite☆168Updated last year
- COBS - Compact Bit-Sliced Signature Index (for Genomic k-Mer Data or q-Grams)☆88Updated last year
- Log shell-commands and used files. Snapshot executed scripts. Fully automatic.☆206Updated 9 months ago
- Parallel Block GZIP☆50Updated 9 years ago
- A 'time'-like utility for Unix that measures peak memory usage☆69Updated 13 years ago
- Gzip Decompression and Random Access for Modern Multi-Core Machines☆443Updated last month
- A very fast interval tree data structure☆127Updated 11 months ago
- ☆18Updated 8 years ago
- Augmented Interval Tree implemented in Cython/C☆20Updated 11 months ago
- 📖 🧬 SSHash is a compressed, associative, exact, and weighted dictionary for k-mers.☆90Updated 3 weeks ago
- Rust implementation of probminhash, superminhash and hyperloglog sketching algorithms☆31Updated 7 months ago
- A command line program for large scale buffering between piped programs☆16Updated 4 years ago
- Compress a file into a seekable zstd with special handling for .tar archives☆75Updated 3 years ago
- A genomic minhashing implementation in Rust☆101Updated 6 months ago
- Implicit Interval Tree with Interpolation Index☆42Updated 3 years ago
- Parallel bzip2 utility☆155Updated 5 months ago
- Mantis: A Fast, Small, and Exact Large-Scale Sequence-Search Index☆84Updated last year
- Benchmarking different languages for a simple bioinformatics task (Counting the GC fraction of DNA in a FASTA file)☆57Updated 2 years ago
- Reference server implementation for the GA4GH HTSget API standard.☆12Updated 2 years ago
- cgmemtime measures the high-water RSS+CACHE memory usage of a process and its descendant processes.☆118Updated 8 months ago
- Bonsai: Fast, flexible taxonomic analysis and classification☆70Updated last year
- toolkit for file system virtualisation of random access compressed FASTA, FAI, DICT & TWOBIT files☆22Updated last year
- Performs memory-efficient reservoir sampling on very large input files delimited by newlines☆69Updated 5 years ago
- Open compressed files in Python☆80Updated 2 months ago
- YAML template engine☆41Updated last month
- A cross-platform command-line tool for executing jobs in parallel☆1,069Updated 3 weeks ago
- A multi-threading implement of Python gzip module☆61Updated last week