BBVA / data-refinery
Data transformation
☆23Updated 3 years ago
Alternatives and similar repositories for data-refinery:
Users that are interested in data-refinery are comparing it to the libraries listed below
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- A tool for anomaly detection over streaming data based on sentiment analysis☆30Updated 6 years ago
- ☆30Updated 3 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- ElasticSearch implementation of MlFlow tracking store☆18Updated 4 years ago
- ☆29Updated last year
- Techniques for Scraping the Web in Python☆26Updated 6 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- ☆34Updated 3 months ago
- curated list of awesome tools and libraries for specific domains☆42Updated this week
- real-time data + ML pipeline☆54Updated this week
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- ☆12Updated last year
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- This project trains a Machine Learning model to predict house prices and then exposes Jupyter notebook cells as REST Endpoints to make pr…☆12Updated 6 years ago
- KnowledgeRepo + JupyterLab☆48Updated 4 months ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 3 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- Machine Learning Projects with Flytekit☆36Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Utilities for creating ETL pipelines with mara☆37Updated 2 years ago
- Cloud Pipelines Editor is a web app that allows the users to build and run Machine Learning pipelines without having to set up developmen…☆56Updated 2 years ago
- ☆19Updated 4 years ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆34Updated last week
- Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes cluster☆39Updated 4 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 5 years ago
- Common API for all "second gen" AutoML APIs: Auger.AI, Google Cloud AutoML and Azure AutoML☆41Updated 3 months ago
- Workshop about DVC VSCode Extension☆14Updated 6 months ago