RealImpactAnalytics / trumaniaLinks
Trumania is a scenario-based random dataset generator library in python 3
☆112Updated 3 years ago
Alternatives and similar repositories for trumania
Users that are interested in trumania are comparing it to the libraries listed below
Sorting:
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆78Updated last year
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated last year
- python library for automated dataset normalization☆116Updated last year
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- ☆111Updated 6 months ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆196Updated 2 years ago
- A web frontend for scheduling Jupyter notebook reports☆253Updated 7 months ago
- The easiest way to integrate Kedro and Great Expectations☆52Updated 2 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 4 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- Type System for Data Analysis in Python☆213Updated 5 months ago
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆70Updated 2 months ago
- Summarise and explore Pandas DataFrames☆98Updated 5 years ago
- A simple example of python api for real time machine learning, using scikit-learn, Flask and Docker☆136Updated last year
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- 🍦 Deployment tool for online machine learning models☆97Updated 3 years ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- Machine Flow enables visual execution and tracking of machine learning workflows. Users dynamically create dependency graphs, with each n…☆62Updated 6 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 5 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 4 years ago
- A frictionless integrated platform for notebook☆83Updated 2 years ago
- ☆96Updated 5 years ago