RealImpactAnalytics / trumaniaLinks
Trumania is a scenario-based random dataset generator library in python 3
☆112Updated 3 years ago
Alternatives and similar repositories for trumania
Users that are interested in trumania are comparing it to the libraries listed below
Sorting:
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 4 years ago
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated last year
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 6 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- A series of workshop modules introducing Feast feature store.☆19Updated 3 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- A small Python module containing quick utility functions for standard ETL processes.☆35Updated last month
- A simple Spark TDD example☆26Updated 7 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆109Updated last week
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆99Updated this week
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 3 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆121Updated last month
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Buy Till You Die and Customer Lifetime Value statistical models in Python.☆117Updated last year
- ☆26Updated 4 years ago
- pytest plugin to run the tests with support of pyspark☆86Updated 2 weeks ago
- Machine Flow enables visual execution and tracking of machine learning workflows. Users dynamically create dependency graphs, with each n…☆62Updated 6 years ago
- The easiest way to integrate Kedro and Great Expectations☆52Updated 2 years ago