aivanzhang / panda_patrolLinks
☆23Updated last year
Alternatives and similar repositories for panda_patrol
Users that are interested in panda_patrol are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆117Updated 2 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆44Updated this week
- Write your dbt models using Ibis☆71Updated 7 months ago
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- ☆90Updated last year
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 4 years ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆89Updated 8 months ago
- Linear regression in SQL using dbt☆75Updated 9 months ago
- dagster scikit-learn pipeline example.☆45Updated 2 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- 🏁 A sweet and speedy code generator for dbt 🏎️✨☆30Updated 3 months ago
- Assessing whether data from database complies with reference information.☆43Updated last week
- Dask integration for Snowflake☆30Updated 2 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Repo for orienting dbt users to the Dagster asset framework☆55Updated 3 years ago
- Cost Efficient Data Pipelines with DuckDB☆57Updated 5 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆211Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Palm CLI - the tool-belt for data teams☆47Updated last year
- A serverless duckDB deployment at GCP☆41Updated 3 years ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆106Updated 3 weeks ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- A python library bakeoff for medium sized datasets☆24Updated 2 years ago
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 3 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆26Updated last year
- ✨ Build dashboards with end-to-end version control. 🔋 CLI w/ batteries included, no infra required. Develop on your laptop for instant r…☆78Updated this week