schlerp / pelt-studio
Python ELT Studio, an application for building ELT (and ETL) data flows.
☆57Updated 3 years ago
Alternatives and similar repositories for pelt-studio:
Users that are interested in pelt-studio are comparing it to the libraries listed below
- manipulate pandas dataframes from the comfort of your browser☆171Updated 3 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆34Updated last week
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆83Updated this week
- A monorepo of many Rill example projects☆33Updated 2 weeks ago
- dagster scikit-learn pipeline example.☆44Updated last year
- A FastAPI CLI & Streamlit App wrapper for Excel files... create APIs from Excel data files within seconds☆70Updated last year
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated last year
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆121Updated 8 months ago
- A curated list of dagster code snippets for data engineers☆53Updated 11 months ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆42Updated 2 years ago
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- a collection of resources and blogs about Apache Superset☆81Updated 3 years ago
- Notebook gallery and issue tracking for Atoti☆223Updated this week
- 🏗️ Create APIs from CSV files within seconds, using fastapi☆77Updated 3 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 10 months ago
- A modern, enterprise-ready business intelligence web application. Unleash the value of your data. 📈 📉 📊☆32Updated last year
- ☆65Updated 6 months ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆125Updated 2 weeks ago
- Data Lineage Tracing Library☆22Updated 3 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 2 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- A batteries included docker build including Streamlit + Visualization Tools + Pandas/Numpy + More☆11Updated 3 years ago
- dotML is a light-weight semantic layer written in Python.☆32Updated last year
- a convenient way to anonymize your data for analytics☆20Updated 3 years ago
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)☆14Updated 2 years ago