stefan-schroedl / tabulator
A set of Unix shell command line tools for quick and convenient batch processing of tabular text files (a.k.a., tab-delimited, tsv, csv, or flat data file format) with a header line. Provides column reference by name, automatic delimiter and compression detection for per-line transformations, sql-like group-by operation and relational join.
☆35Updated 9 years ago
Alternatives and similar repositories for tabulator:
Users that are interested in tabulator are comparing it to the libraries listed below
- mirror of GNU Datamash. Send questions/comments/bugs to bug-datamash@gnu.org☆75Updated last month
- tv(table viewer) for delimited text file(csv,tsv,etc) in terminal.☆44Updated 4 years ago
- Convenience function for quick and dirty data analysis☆68Updated 7 years ago
- Write-once-read-many table for large datasets.☆27Updated last year
- An example of a data analysis pipeline using Make☆17Updated 8 years ago
- Unix terminal histograms and bar charts (hissyfit). Small data/file munge, morph, find and count scripts/functions in various languages, …☆22Updated last week
- AWK and Bash code to easily parse CSV files, with possibly embedded commas and quotes.☆54Updated 7 years ago
- python stuff I use☆19Updated 5 years ago
- The command-line interface to GGD☆42Updated 2 years ago
- a simple read-only sequence database, designed for short reads☆66Updated last year
- Vince Buffalo's devnotes — ½ TIL, ½ notebook☆116Updated 9 years ago
- Python for Command Line Oneliners☆19Updated 10 years ago
- Simple tool to verticalize text delimited files.☆36Updated 11 months ago
- Hail: extract lines from a file, a la `head -n x | tail -n y`☆9Updated 4 years ago
- A light-weight HTML lab notebook generator☆18Updated last year
- Snakemake library for bioinformatics programs, with a focus on next-generation sequencing☆22Updated 9 years ago
- This project contains simple methods to measure sample relatedness and identify potential swaps and contamination☆10Updated 9 years ago
- Unix 'cut' (and 'paste') on steroids: more flexible select columns from files☆68Updated 3 years ago
- A command line tool that uses a Boolean calculus of calendars for computing availability (currently supports Google Calendar API)☆25Updated 3 years ago
- Provides access to complex Bioinformatics software (even BioLinux!) in just one command.☆76Updated 7 years ago
- Abbreviate strings to short, unique identifiers☆24Updated 2 years ago
- Mulled - Automatized Containerized Software Repository☆66Updated last year
- A 'time'-like utility for Unix that measures peak memory usage☆66Updated 12 years ago
- A package for creating and managing sample identifiers in comparative -omics datasets.☆23Updated 8 years ago
- Generate kmers/minimizers/hashes/MinHash signatures, including with multiple kmer sizes.☆24Updated 4 years ago
- Python library that facilitates opening, reading, and writing files (and file-like entities like URLs and streams) agnostic of compressio…☆22Updated last week
- YAML template engine☆31Updated 2 months ago
- ☆13Updated 7 years ago
- Useful FILe and stream Operations☆45Updated 9 years ago
- Parallel Block GZIP☆50Updated 8 years ago