RealImpactAnalytics / trumaniaLinks
Trumania is a scenario-based random dataset generator library in python 3
☆112Updated 3 years ago
Alternatives and similar repositories for trumania
Users that are interested in trumania are comparing it to the libraries listed below
Sorting:
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 4 years ago
- python library for automated dataset normalization☆115Updated last year
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated last year
- Type System for Data Analysis in Python☆212Updated 4 months ago
- ☆111Updated 5 months ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- Jupyter Notebook and Python business intelligence tools and techniques. [Raw upload]☆85Updated last year
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated last year
- The easiest way to integrate Kedro and Great Expectations☆52Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated 2 years ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- Predict whether or not a patient will show up to their next appointment using automated feature engineering☆28Updated 4 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- This article compares open-source Python packages for pipeline/workflow development: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX.☆57Updated 4 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- dagster scikit-learn pipeline example.☆44Updated 2 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆196Updated 2 years ago
- A fork of the cookiecutter-data-science leveraging Docker for local development.☆131Updated 5 years ago
- A frictionless integrated platform for notebook☆85Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆110Updated last week
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- 🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & …☆213Updated last year
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆29Updated 2 years ago
- Reference package for unit tests☆49Updated 6 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆35Updated this week