MrPowers / python-parquet-examplesLinks
Using the Parquet file format with Python
☆15Updated last year
Alternatives and similar repositories for python-parquet-examples
Users that are interested in python-parquet-examples are comparing it to the libraries listed below
Sorting:
- Pandas helper functions☆31Updated 2 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- ☆29Updated last year
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- A collection of python utility functions☆11Updated 11 months ago
- Big Data Demystified meetup and blog examples☆31Updated 10 months ago
- Dask integration for Snowflake☆30Updated 7 months ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated 3 months ago
- Dask on ECS Fargate☆14Updated 5 years ago
- Prefect 2 flows☆11Updated 6 months ago
- IceRunner is an Apache Arrow Flight Server Implementation for Apache Iceberg Tables☆9Updated 2 months ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last year
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- ☆11Updated 6 months ago
- ☆11Updated 7 months ago
- Fully unit tested utility functions for data engineering. Python 3 only.☆17Updated 10 months ago
- A small Python module containing quick utility functions for standard ETL processes.☆35Updated last week
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆15Updated 4 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.☆11Updated 5 months ago
- A serverless duckDB deployment at GCP☆39Updated 2 years ago
- Customizable GitOps template for Kubeflow on AWS EKS☆10Updated 4 years ago
- Repository containing various utils related to Snowflake migration at Faire.☆12Updated 2 years ago
- Styles for dbt on the net☆10Updated 6 months ago
- Prefect integrations for working with OpenAI.☆34Updated last year