MrPowers / python-parquet-examples
Using the Parquet file format with Python
☆15Updated last year
Alternatives and similar repositories for python-parquet-examples:
Users that are interested in python-parquet-examples are comparing it to the libraries listed below
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Pandas helper functions☆30Updated 2 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆21Updated last year
- Config files for setting up Multitenant Kubeflow on AWS with spot instances☆10Updated 4 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- ☆14Updated 3 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Dask integration for Snowflake☆30Updated 4 months ago
- This repository is no longer maintained.☆15Updated 3 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 2 years ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- ☆47Updated last week
- ☆29Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- A CLI to manage and monitor permissions in AWS Lake Formation☆27Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Plugin for Intake to read from SQL servers☆15Updated last year
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆20Updated last year
- A Delta Lake reader for Dask☆49Updated 5 months ago
- Open innovation with 60 minute cloud experiments on AWS☆87Updated 11 months ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated last week
- Code accompanying AWS blog post "Build a Semantic Search Engine for Tabular Columns with Transformers and Amazon OpenSearch Service"☆17Updated last year
- A collection of python utility functions☆11Updated 8 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year