BBVA / data-refinery
Data transformation
☆23Updated 3 years ago
Alternatives and similar repositories for data-refinery:
Users that are interested in data-refinery are comparing it to the libraries listed below
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- Time series based anomaly detector☆82Updated 3 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- ☆30Updated last year
- ☆34Updated last month
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- A repository of examples for Elyra (https://github.com/elyra-ai/elyra)☆81Updated 2 weeks ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 5 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Model drift detection☆11Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- Big Data Demystified meetup and blog examples☆31Updated 5 months ago
- MLflow and Prefect with docker-compose☆16Updated 2 years ago
- MLOps simplified. One platform, all the functionality you need. Swiss made☆97Updated last month
- ElasticSearch implementation of MlFlow tracking store☆18Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆32Updated last year
- datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest…☆58Updated 3 years ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated last year
- ☆19Updated 3 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆34Updated this week
- Awesome List for Data Operations☆24Updated 4 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 3 years ago
- ☆37Updated 5 years ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆64Updated 11 months ago