patterns-app / patterns-devkit
Data pipelines from re-usable components
☆106Updated last year
Related projects ⓘ
Alternatives and complementary repositories for patterns-devkit
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated 8 months ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- Codd method-chained SQL generator and Pandas data processing in Python.☆115Updated last year
- Type System for Data Analysis in Python☆207Updated 3 months ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- ☆21Updated 2 months ago
- Universal data copy☆9Updated 2 years ago
- Data Catalog for Databases and Data Warehouses☆31Updated 9 months ago
- ☆115Updated last year
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated last year
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆140Updated last year
- Visualize Airflow's schedule by exporting future DAG runs as events to Google Calendar.☆70Updated last year
- A serverless duckDB deployment at GCP☆35Updated 2 years ago
- Python binding for DataFusion☆59Updated 2 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65Updated 3 years ago
- afctl helps to manage and deploy Apache Airflow projects faster and smoother.☆130Updated 2 years ago
- ☆82Updated 6 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆122Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆37Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated last year
- Arrow, pydantic style☆82Updated last year
- ☆76Updated last year
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆77Updated this week
- sgr (command line client for Splitgraph) and the splitgraph Python library☆324Updated 6 months ago
- Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with function…☆91Updated 2 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆168Updated 9 months ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆121Updated 5 months ago
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆64Updated this week
- The stupidest database of all time.☆55Updated this week