nickmancol / python_data_pipeline
A Simple Pure Python Data Pipeline to process a Data Stream
☆9Updated 4 years ago
Alternatives and similar repositories for python_data_pipeline:
Users that are interested in python_data_pipeline are comparing it to the libraries listed below
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- ☆15Updated last year
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 5 years ago
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- CSS & HTML on Python Easily☆11Updated 4 months ago
- ☀️🦶 A lightweight framework for collaborative, open-source feature engineering☆32Updated 3 years ago
- A collection of python utility functions☆11Updated 7 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 11 months ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- ☆27Updated 11 months ago
- ☆19Updated 3 years ago
- Function decorators for Pandas Dataframe column name and data type validation☆16Updated last week
- Open source bits of athenian-api.☆19Updated last year
- Python utility to extract differences between two pandas dataframes.☆11Updated 7 months ago
- A small Python module containing quick utility functions for standard ETL processes.☆34Updated this week
- ☆12Updated last year
- A browser-based Parquet file viewer☆43Updated 2 weeks ago
- Getting started with DuckDB, by Packt Publishing☆51Updated 6 months ago
- ☆49Updated 3 years ago
- ☆26Updated 2 years ago
- ☆12Updated last year
- Updated 5 months ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆82Updated 9 months ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- Dash Component created from ukrbublik/react-awesome-query-builder☆12Updated last week
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 3 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Deploying a simple FastAPI app to Fly.io >> https://fly-fastapi.fly.dev/docs <<☆14Updated last year
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated 2 months ago