Data pipelines from re-usable components
☆107Nov 12, 2025Updated 5 months ago
Alternatives and similar repositories for patterns-devkit
Users that are interested in patterns-devkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- ☆15Apr 4, 2021Updated 5 years ago
- Curiosity based exploration and playing in RL with Gym Robotics envs.☆12Sep 25, 2018Updated 7 years ago
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆814Aug 10, 2025Updated 8 months ago
- Turn an AWS api call into a readable stream☆24May 23, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Data Catalog for Databases and Data Warehouses☆36Jan 15, 2024Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆39Dec 16, 2022Updated 3 years ago
- An interpreted relational query language that compiles to SQL.☆629Aug 17, 2022Updated 3 years ago
- Numeric and scientific computing on GPUs for Python with a NumPy-like API☆93Sep 1, 2021Updated 4 years ago
- An improved Python interface to SQLite☆14Feb 4, 2023Updated 3 years ago
- general-purpose fast, stateless, and deterministic feature extractor written in golang for use in machine learning☆12Mar 17, 2018Updated 8 years ago
- Backend for skillgraph - a skill based framework for building agents that work.☆31Nov 10, 2025Updated 5 months ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆172Feb 10, 2024Updated 2 years ago
- Streaming reactive and dataflow graphs in Python☆460Mar 30, 2026Updated last week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Jan 11, 2024Updated 2 years ago
- A scikit-learn-compatible module for Isolation-based anomaly detection using nearest-neighbor ensembles☆12Aug 30, 2023Updated 2 years ago
- A framework for rapid development of robust data pipelines following a simple design pattern☆27Feb 26, 2024Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Feb 18, 2022Updated 4 years ago
- Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool☆14Dec 12, 2025Updated 4 months ago
- Data Mesh Architecture☆85Oct 15, 2025Updated 5 months ago
- Data Lineage Tracing Library☆24Nov 30, 2021Updated 4 years ago
- Simple finite state machines.☆28Jul 7, 2022Updated 3 years ago
- The stupidest database of all time.☆56Mar 27, 2026Updated 2 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- KNOTS is an intuitive desktop application built to simplify the configuration of Singer pipelines☆67Jan 20, 2023Updated 3 years ago
- Read fixed width data files with Python 3☆14Mar 20, 2026Updated 3 weeks ago
- Linux kernel for SHIELD☆23Mar 12, 2015Updated 11 years ago
- Official dbt adapter for Vertica☆28Jun 13, 2025Updated 10 months ago
- A thread synchonized queue made for PThreads☆11Jan 15, 2021Updated 5 years ago
- Apache Arrow PostgreSQL connector☆62Feb 12, 2024Updated 2 years ago
- Literate is a Clojure & ClojureScript application which you can use to create documents.☆17May 10, 2023Updated 2 years ago
- Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.☆117Dec 26, 2022Updated 3 years ago
- Build a REST API on top of your data warehouse☆42Oct 19, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆303Mar 12, 2026Updated last month
- Quilt is a Scientific Data Management Platform on AWS that helps teams and AI find, trust, and reuse data through deeply versioned, conte…☆1,362Updated this week
- Test all the data☆37Oct 20, 2023Updated 2 years ago
- Redshift Auto Schema is a Python library that takes a delimited flat file or parquet file as input, parses it, and provides a variety of …☆29Jun 21, 2021Updated 4 years ago
- Skeleton project for Apache Airflow training participants to work on.☆17Jul 9, 2020Updated 5 years ago
- Build data pipelines, the easy way 🛠️☆4,136Jun 6, 2023Updated 2 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65May 15, 2021Updated 4 years ago