BBVA / data-refineryLinks
Data transformation
☆23Updated 4 years ago
Alternatives and similar repositories for data-refinery
Users that are interested in data-refinery are comparing it to the libraries listed below
Sorting:
- KnowledgeRepo + JupyterLab☆48Updated 8 months ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 4 years ago
- MLflow App Library☆80Updated 6 years ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆99Updated last week
- Tutorials for YData's Fabric platform☆33Updated 2 months ago
- Deployment tools/scripts for Metaflow!☆56Updated 2 years ago
- 📚 Jupyter Notebooks extension for versioning, managing and sharing notebook checkpoints in your machine learning and data science projec…☆34Updated last year
- Techniques for Scraping the Web in Python☆25Updated 7 years ago
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Updated 2 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- bamboolib - template for creating your own binder notebook☆21Updated 3 years ago
- Trumania is a scenario-based random dataset generator library in python 3☆112Updated 3 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- PySpark, Databrick, h2o, MLlib☆19Updated 8 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated last week
- Projects developed by Domino's R&D team☆78Updated 3 years ago
- 📝 A blog post about report generation and automation in python☆40Updated 5 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- An HTTP endpoint to create on-demand dynamic HTML reports powered by Jupyter Notebooks☆19Updated 3 years ago
- ☆19Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- How to do data science with Optimus, Spark and Python.☆19Updated 6 years ago
- Deployment template for a continuous training pipeline.☆21Updated 3 years ago
- ☆59Updated 3 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 8 months ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated 2 years ago
- Operations Research Algorithms☆17Updated last year
- Simple template showing how to set up docker for reproducible data science with Jupyter notebooks.☆23Updated last year
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 5 years ago