A library to mutate parquet files
☆19May 9, 2023Updated 2 years ago
Alternatives and similar repositories for parquet-rewriter
Users that are interested in parquet-rewriter are comparing it to the libraries listed below
Sorting:
- NYT Connections for KOReader☆14Sep 1, 2025Updated 6 months ago
- Python Package to Share/Edit Pandas/Polars DF with web interface!☆11Jun 10, 2025Updated 8 months ago
- A template for wrapping any Java builder (eg., Maven Takari builder) and bring it into Bazel.☆10May 1, 2025Updated 10 months ago
- This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, …☆17Feb 5, 2026Updated last month
- How to customize Tableau authentication using the AWS Athena's JDBC Credentials Provider capabilites.☆14Jun 8, 2020Updated 5 years ago
- ☆10Apr 11, 2019Updated 6 years ago
- (NO LONGER MAINTAINED) PHP tool for making a replica of Salesforce data☆12Sep 15, 2015Updated 10 years ago
- An in-memory point-in-polygon (reverse geocoding) package for GeoJSON data, principally Who's On First data.☆11Dec 17, 2022Updated 3 years ago
- Protobuf messages in a bottle☆10Feb 14, 2025Updated last year
- A Django-based FHIR server that uses MongoDB for FHIR resource storage☆10Apr 19, 2018Updated 7 years ago
- Provides a `Project` CRD and controller for k8s to help with organising resources☆12Apr 19, 2024Updated last year
- Automates the creation of full-text (sound and text) ebooks in epub/epub3/daisy format, the webserver/client creates smil files to sync a…☆10Nov 12, 2021Updated 4 years ago
- A script to create a docker image from a virtualenv☆10Mar 16, 2016Updated 9 years ago
- An implementation of Racket's Scribble in Clojure☆22Sep 20, 2013Updated 12 years ago
- My personal NixOS configs☆13Updated this week
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- C 结构体与 JSON 快速互转库☆11Nov 27, 2017Updated 8 years ago
- Associated blog post - https://tristanrhodes.com/blog/Adventures-in-Algorithmic-Trading-on-the-Runescape-Grand-Exchange☆10Oct 14, 2024Updated last year
- smbus provides access to the System Management bus over I2C☆15Dec 16, 2020Updated 5 years ago
- Go wrapper around SSH that speaks AWS API☆16Aug 15, 2023Updated 2 years ago
- A Configuration System for Airflow☆16Updated this week
- Pager for tabular data and SQL output☆12Mar 29, 2023Updated 2 years ago
- Library for HTTP request/response workflow☆15Oct 14, 2025Updated 4 months ago
- Documenting various metrics available for open source projects☆14Jan 4, 2024Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- whisk is a data science project framework that makes collaboration, reproducibility, and deployment "just work".☆11Dec 26, 2022Updated 3 years ago
- Clone of https://git.kernel.org/pub/scm/linux/kernel/git/gong.chen/aer-inject.git/ with 32-bit domain support☆12May 30, 2017Updated 8 years ago
- Bazel BUILD files generator for Kotlin/Java projects.☆14Nov 24, 2020Updated 5 years ago
- MCP server for lethain:systems Python library☆14Aug 17, 2025Updated 6 months ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- Apache Arrow Flight example☆11Nov 9, 2020Updated 5 years ago
- Author Hudson CI Plugins in Ruby☆19Aug 11, 2011Updated 14 years ago
- SnowShu is a sampling engine designed to support testing in data development.☆12Aug 26, 2025Updated 6 months ago
- An intelligent predictive text entry platform. Mirror of git://git.code.sf.net/p/presage/presage Please send reports to the SourceForge b…☆11Aug 17, 2015Updated 10 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Deploy an AWS ECS Cluster of EC2 Instances with Terraform☆13Dec 26, 2023Updated 2 years ago
- Python algorithms for regularized regression☆24Sep 7, 2015Updated 10 years ago
- adidas Data Mesh implementation☆12May 13, 2022Updated 3 years ago
- A dbt package to run natural language queries☆10Jan 13, 2023Updated 3 years ago