Convert JSON files to Parquet using PyArrow
☆99Jan 10, 2024Updated 2 years ago
Alternatives and similar repositories for json2parquet
Users that are interested in json2parquet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆10Sep 4, 2025Updated 7 months ago
- Write Promethues metrics to Parquet files for long-term storage and querying☆10Oct 5, 2020Updated 5 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Sep 5, 2023Updated 2 years ago
- Deploy dask on YARN clusters☆69Aug 10, 2024Updated last year
- Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.☆117Dec 26, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Apache Beam example project☆13Oct 16, 2019Updated 6 years ago
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Nov 16, 2021Updated 4 years ago
- Redshift Python library for user agent detection (browsers, devices, etc) and parsing via UDFs☆10May 27, 2020Updated 5 years ago
- Query system statistics with SQL.☆17Jun 9, 2023Updated 2 years ago
- ☆10Jun 13, 2018Updated 7 years ago
- A CLI and library to run Singer Taps and Targets☆36Apr 13, 2026Updated 2 weeks ago
- AWS Lambda function used to scrape rental data from Craigslist.☆11Dec 8, 2022Updated 3 years ago
- ☆23Apr 13, 2019Updated 7 years ago
- ShRAGa is a lightweight framework for building RAG applications, created by BigData Boutique☆12Feb 22, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyAthena is a Python DB API 2.0 (PEP 249) client for Amazon Athena.☆493Apr 14, 2026Updated 2 weeks ago
- Flask app to calculate compensation of a data scientist☆12Dec 27, 2022Updated 3 years ago
- ☆10Dec 15, 2018Updated 7 years ago
- CLI tool for working with EC2☆11Oct 5, 2025Updated 6 months ago
- Spark batch converter to convert AWS S3 server side logs to Parquet file format☆11Mar 24, 2023Updated 3 years ago
- AWS MWAA Quick Start With Terraform.☆20Aug 19, 2021Updated 4 years ago
- Rate-limiter module which leverages DynamoDB to enforce resource limits☆11Oct 11, 2023Updated 2 years ago
- A Python framework for text mining. Specially Twitter text.☆13Dec 12, 2017Updated 8 years ago
- Wizard's Castle as a vehicle for learning Rust☆18Mar 31, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Showcase on synchronous and asynchronous event processing using AWS SNS, AWS SQS and AWS Lambda.☆11Jun 28, 2018Updated 7 years ago
- App para geração e envio de certificados para eventos☆18Jan 6, 2023Updated 3 years ago
- A python client library for the Stitch Import API☆43Jan 5, 2024Updated 2 years ago
- Strongly opinionated python project management.☆39Dec 9, 2022Updated 3 years ago
- ☆29Sep 30, 2020Updated 5 years ago
- Getting some f# into google cloud functions☆10Feb 9, 2017Updated 9 years ago
- dbt adapter for Athena☆38May 28, 2024Updated last year
- Logging, with pretty coloured squares all over the place.☆18Dec 8, 2022Updated 3 years ago
- A work around implementation to allow webhooks to be called from inside Snowflake. This allows powerful bidirectional integration between…☆15Oct 4, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Javascript library to do Event Tracking using Google Analytics or Piwik☆10Mar 15, 2018Updated 8 years ago
- Modification of the excellent posenet_python to run with the gstreaming components present on the NVIDIA Jetson Nano.☆11May 13, 2020Updated 5 years ago
- OpenTracing instrumentation for Requests☆11Mar 15, 2025Updated last year
- Introduction to F#☆10Jul 22, 2019Updated 6 years ago
- Ultimate LED panel. 8x8 matrix. APA102 RGB LEDs☆12Mar 12, 2024Updated 2 years ago
- A collection of python utility functions☆11Apr 21, 2026Updated last week
- A Starter project for Serverless services with NodeJS 6.10 and Serverless Framework☆12Mar 4, 2018Updated 8 years ago