BBVA / data-refineryLinks
Data transformation
☆23Updated 4 years ago
Alternatives and similar repositories for data-refinery
Users that are interested in data-refinery are comparing it to the libraries listed below
Sorting:
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆106Updated last week
- Common API for all "second gen" AutoML APIs: Auger.AI, Google Cloud AutoML and Azure AutoML☆41Updated last year
- Automated Data Science and Machine Learning library to optimize workflow.☆105Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆36Updated 2 years ago
- ☆36Updated 9 months ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 3 years ago
- real-time data + ML pipeline☆53Updated last week
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 6 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Updated 6 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 6 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 3 years ago
- Large-scale Graph Mining with Spark☆39Updated 7 years ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 5 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- MLflow App Library☆77Updated 7 years ago
- KnowledgeRepo + JupyterLab☆48Updated 2 weeks ago
- Primrose modeling framework for simple production models☆33Updated last year
- A series of workshop modules introducing Feast feature store.☆19Updated 3 years ago
- Python implementations of record linkage blocking techniques.☆21Updated 2 years ago
- pycaret-demo-mlflow☆30Updated 4 years ago
- Tutorials for YData's Fabric platform☆35Updated 8 months ago
- Operations Research Algorithms☆19Updated last year
- plait.py - a fake data modeler☆436Updated 7 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆52Updated 4 years ago
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 5 years ago
- Time series based anomaly detector☆82Updated 4 years ago
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Updated 3 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆127Updated 4 years ago
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆31Updated 5 years ago
- Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark☆74Updated 2 years ago