RealImpactAnalytics / trumaniaLinks

Trumania is a scenario-based random dataset generator library in python 3

☆110

Alternatives and similar repositories for trumania

Users that are interested in trumania are comparing it to the libraries listed below

Sorting:

tomaszdudek7 / airflow_project
scaffold of Apache Airflow executing Docker containers
☆85Updated 3 years ago
manifoldai / orbyter-docker
Dockerfiles for images used as part of the Orbyter toolset
☆44Updated last year
Bergvca / pyspark_dist_explore
Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.
☆102Updated 6 years ago
d6t / d6tstack
Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet
☆196Updated 2 years ago
Ashton-Sidhu / aethos
Automated Data Science and Machine Learning library to optimize workflow.
☆105Updated 2 years ago
alteryx / autonormalize
python library for automated dataset normalization
☆117Updated 2 years ago
vaexio / dash-120million-taxi-app
Explore 120 million taxi trips in real time with Dash and Vaex
☆117Updated 5 years ago
MrPowers / ceja
PySpark phonetic and string matching algorithms
☆41Updated last year
sosuneko / pydqc
python automatic data quality check toolkit
☆278Updated 5 years ago
drivendataorg / nbautoexport
Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.
☆84Updated 4 months ago
Soluto / python-flask-sklearn-docker-template
A simple example of python api for real time machine learning, using scikit-learn, Flask and Docker
☆136Updated 2 years ago
jgoerner / data-science-stack-cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & …
☆215Updated 2 years ago
datacamp / viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
☆127Updated 4 years ago
mara / mara-etl-tools
Utilities for creating ETL pipelines with mara
☆36Updated 3 years ago
ing-bank / spark-matcher
Record matching and entity resolution at scale in Spark
☆36Updated 2 years ago
darenasc / auto-eda
Automated Exploratory Data Analysis. Simplifying Data Exploration
☆36Updated 5 years ago
johnmuller87 / spark-udf
☆34Updated 6 years ago
michaelchanwahyan / datalab
☆113Updated last year
hi-primus / bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
☆141Updated 2 years ago
tkp-archive / paperboy
A web frontend for scheduling Jupyter notebook reports
☆254Updated last year
kenfar / DataGristle
Tough and flexible tools for data analysis, transformation, validation and movement.
☆140Updated 2 years ago
dvgodoy / handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
☆197Updated 6 years ago
BigDataRepublic / bdr-analytics-py
Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…
☆30Updated 7 years ago
ubisoft / mobydq
Tool to automate data quality checks on data pipelines
☆256Updated 3 years ago
ColtAllen / btyd
Buy Till You Die and Customer Lifetime Value statistical models in Python.
☆118Updated last year
d6t / d6t-python
Accelerate data science
☆118Updated 4 years ago
schlerp / flowpy
manipulate pandas dataframes from the comfort of your browser
☆174Updated 4 years ago
capitalone / locopy
locopy: Loading/Unloading to Redshift and Snowflake using Python.
☆115Updated last week
paypal / PPExtensions
Set of iPython and Jupyter extensions to improve user experience
☆50Updated 6 years ago
tonyleidong / OptimalFlow
OptimalFlow is an omni-ensemble and scalable automated machine learning Python toolkit, which uses Pipeline Cluster Traversal Experiments…
☆27Updated 2 years ago