MrPowers / python-parquet-examplesLinks
Using the Parquet file format with Python
☆15Updated last year
Alternatives and similar repositories for python-parquet-examples
Users that are interested in python-parquet-examples are comparing it to the libraries listed below
Sorting:
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆110Updated 2 years ago
- ☆29Updated last year
- Styles for dbt on the net☆10Updated 7 months ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Library of Prefect tasks and utilities.☆9Updated 9 months ago
- Fully unit tested utility functions for data engineering. Python 3 only.☆17Updated 10 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Dask integration for Snowflake☆30Updated 7 months ago
- Repository containing various utils related to Snowflake migration at Faire.☆12Updated 2 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated last week
- Projects developed by Domino's R&D team☆78Updated 3 years ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 2 months ago
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 3 years ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆21Updated last year
- dbt package for monitoring airflow DAGs and tasks☆29Updated 4 months ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 4 years ago
- Unity Catalog UI☆40Updated 10 months ago
- Pandas helper functions☆31Updated 2 years ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆15Updated 8 months ago
- Prefect 2 flows☆11Updated 7 months ago
- A serverless duckDB deployment at GCP☆39Updated 2 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Demos of Materialize, the operational data warehouse.☆51Updated 4 months ago
- Events about the open source data stack☆13Updated 3 years ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆32Updated 2 years ago