patterns-app / patterns-devkit
Data pipelines from re-usable components
☆108Updated last year
Alternatives and similar repositories for patterns-devkit:
Users that are interested in patterns-devkit are comparing it to the libraries listed below
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- Type System for Data Analysis in Python☆210Updated 2 weeks ago
- ☆21Updated 5 months ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆140Updated last year
- Demos of Materialize, the operational data warehouse.☆51Updated 5 months ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65Updated 3 years ago
- ☆86Updated 9 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated last week
- Codd method-chained SQL generator and Pandas data processing in Python.☆117Updated last year
- Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with function…☆90Updated 3 years ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- Python binding for DataFusion☆59Updated 2 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆184Updated last year
- Data Catalog for Databases and Data Warehouses☆32Updated last year
- Visualize Airflow's schedule by exporting future DAG runs as events to Google Calendar.☆70Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago
- GraphQL service for arrow tables and parquet data sets.☆88Updated 3 weeks ago
- Ibis analytics, with Ibis (and more!)☆20Updated 4 months ago
- ☆82Updated last year
- Arrow, pydantic style☆84Updated 2 years ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆189Updated this week
- Universal data copy☆9Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆112Updated 10 months ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆16Updated 2 weeks ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 3 years ago
- Ibis Substrait Compiler☆98Updated this week