chhantyal / parquet-cliView external linksLinks
Command line (CLI) tool to inspect Apache Parquet files on the go
☆198Nov 9, 2023Updated 2 years ago
Alternatives and similar repositories for parquet-cli
Users that are interested in parquet-cli are comparing it to the libraries listed below
Sorting:
- easy install parquet-tools☆184Jul 9, 2024Updated last year
- Parquet Command-line Tools☆19Oct 26, 2016Updated 9 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- A docker container designed for kubernetes, forwarding logs to AWS S3☆23May 12, 2021Updated 4 years ago
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Feb 23, 2022Updated 3 years ago
- quickly ssh into gcloud instances☆17May 29, 2021Updated 4 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Jan 11, 2019Updated 7 years ago
- Marshmallow Schema generator for Pandas DataFrames☆24Aug 11, 2020Updated 5 years ago
- PouchDB + Android = deliciously synchronous (DEPRECATED! DO NOT USE!)☆29Nov 15, 2016Updated 9 years ago
- DataFuse operator manages fuse-query and fuse-store clusters atop Kubernetes using CRDs.☆13Jul 4, 2022Updated 3 years ago
- A decorator that sends alert when a Prefect flow fails☆15Apr 5, 2023Updated 2 years ago
- spark-emr☆15Apr 17, 2014Updated 11 years ago
- pip-installable SQLite extensions☆15Feb 23, 2023Updated 2 years ago
- A trivial wrapper around spf13/cobra to simplify some basic patterns☆21Oct 23, 2023Updated 2 years ago
- ☆16Jan 23, 2026Updated 3 weeks ago
- Running EKS Workers on Spot Instances☆17Oct 14, 2019Updated 6 years ago
- ☆34Mar 30, 2021Updated 4 years ago
- AWS Blog post code for running feature-extraction on images using AWS Batch and Cloud Development Kit (CDK).☆20Oct 28, 2022Updated 3 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆16Mar 27, 2024Updated last year
- Interoperability libraries & additional data structures and instances for Scalaz☆54May 4, 2015Updated 10 years ago
- BigQuery Schema Conversion Tool☆23Oct 6, 2020Updated 5 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 3 months ago
- python implementation of the parquet columnar file format.☆883Jan 6, 2026Updated last month
- Git Wrapper for Dataset Management☆15Jul 20, 2023Updated 2 years ago
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 4 years ago
- Spark data profiling utilities☆22Nov 24, 2018Updated 7 years ago
- ⚙️ Airflow data pipeline with Terraform, GCP BigQuery, dbt, Soda and Looker Studio.☆23Oct 19, 2023Updated 2 years ago
- Deprecated, please check timoni☆23Jan 4, 2024Updated 2 years ago
- A simple Proof of Concept of a vulnerable web app using a distroless image and Python.☆25Mar 21, 2022Updated 3 years ago
- A tool to test the performance and correctness of kafka mirroring.☆25Dec 12, 2019Updated 6 years ago
- A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support loc…☆304Oct 8, 2025Updated 4 months ago
- A python implementation of the most commonly used variants of the G-test☆26Feb 4, 2019Updated 7 years ago
- ☆23Aug 27, 2022Updated 3 years ago
- Use SQL to transform your avro schema/records☆28Jan 12, 2018Updated 8 years ago
- How query engine work golang port for learning purpose☆23Dec 25, 2021Updated 4 years ago
- A multi-platform file-configurable folder comparison tool with html-reporting written in rust☆12Updated this week
- PyTorch Flexible Hash Embeddings☆28Feb 4, 2020Updated 6 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago