pavelmaksimov / FlowMaster
ETL flow framework based on Yaml configs in Python
☆21Updated 11 months ago
Related projects: ⓘ
- dagster scikit-learn pipeline example.☆43Updated last year
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆40Updated 2 years ago
- Data catalog for everything in your company☆50Updated last year
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 2 years ago
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆60Updated this week
- Source files for TheiaIDE image suitable for Rockstat platform. Contains number useful tools and dependencies☆12Updated 4 years ago
- PyPI analytics powered by ClickHouse☆39Updated 2 weeks ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆79Updated last week
- Describe business metrics with YAML, query and visualize in Jupyter with zero SQL☆21Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆81Updated 2 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 2 years ago
- NoSQL extract, transform, load (ETL) toolkit with Python☆11Updated 3 weeks ago
- A curated list of dagster code snippets for data engineers☆48Updated 6 months ago
- OlaPy, an experimental OLAP engine based on Pandas☆106Updated last year
- A modern, enterprise-ready business intelligence web application. Unleash the value of your data. 📈 📉 📊☆30Updated last year
- Simple, lightweight, extensible DAG framework for Python with a Kubeflow-like API☆63Updated 6 months ago
- Cloud-agnostic Python API☆59Updated 3 months ago
- Orchestration of data science and earth observation models in Apache Airflow, scale-up with Celery Executor, experiment with jupyter note…☆35Updated 2 years ago
- A Declarative ORM for Redis using Pydantic Models and aioredis☆53Updated last week
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆84Updated last year
- dbt module for myBI connect☆11Updated last year
- Adaptation postgres adapter for Greenplum☆32Updated 6 months ago
- UDF to seamlessly connect ClickHouse to Vertica using external tables☆16Updated 2 years ago
- Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool☆12Updated 11 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆36Updated this week
- Utils for fastapi based services.☆34Updated 3 years ago
- Low-code Python library enabling access to APIs, tools, data sources in seconds.☆56Updated last month
- The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on…☆136Updated 3 weeks ago
- This is where to start the data transformation with dbt and PostgreSQL☆8Updated 2 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65Updated 3 years ago