RealImpactAnalytics / trumania
Trumania is a scenario-based random dataset generator library in python 3
☆110Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for trumania
- A small Python module containing quick utility functions for standard ETL processes.☆33Updated last week
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated last year
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated 5 months ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆104Updated last week
- ☆109Updated last year
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated last year
- 🍦 Deployment tool for online machine learning models☆97Updated 2 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆140Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 4 years ago
- Jupyter Notebook and Python business intelligence tools and techniques. [Raw upload]☆84Updated last year
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 2 years ago
- Primrose modeling framework for simple production models☆34Updated 7 months ago
- ☆33Updated 5 years ago
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated last year
- Build your feature store with macros right within your dbt repository☆37Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Tough and flexible tools for data analysis, transformation, validation and movement.☆136Updated 9 months ago
- PySpark phonetic and string matching algorithms☆35Updated 8 months ago
- REST-like API exposing Airflow data and operations☆61Updated 5 years ago
- Machine Flow enables visual execution and tracking of machine learning workflows. Users dynamically create dependency graphs, with each n…☆63Updated 5 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 4 years ago
- A web frontend for scheduling Jupyter notebook reports☆251Updated 2 years ago
- A hands-on tutorial showing how to use Python to do anonymisation with synthetic data☆78Updated 2 years ago
- An Ubuntu Vagrant Virtual Machine (VM) with Airflow, a data workflow management system from Airbnb☆9Updated 4 years ago
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Updated 3 years ago