RealImpactAnalytics / trumania
Trumania is a scenario-based random dataset generator library in python 3
☆112Updated 3 years ago
Alternatives and similar repositories for trumania:
Users that are interested in trumania are comparing it to the libraries listed below
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- Predict whether or not a patient will show up to their next appointment using automated feature engineering☆29Updated 4 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializi…☆32Updated 5 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 3 years ago
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated 11 months ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- ☆110Updated 3 months ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 6 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Accelerate data science☆116Updated 3 years ago
- A series of workshop modules introducing Feast feature store.☆19Updated 2 years ago
- A luigi powered analytics / warehouse stack☆88Updated 8 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- A web frontend for scheduling Jupyter notebook reports☆252Updated 4 months ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- 📝 A blog post about report generation and automation in python☆40Updated 5 years ago
- ☆21Updated 7 months ago
- Summarise and explore Pandas DataFrames☆98Updated 4 years ago
- python library for automated dataset normalization☆114Updated last year
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated last year
- Python package for Bayesian Tests / AB Testing☆40Updated 4 years ago
- MLflow App Library☆78Updated 6 years ago
- A simple example of python api for real time machine learning, using scikit-learn, Flask and Docker☆134Updated last year