chen1649chenli / dataOpsResourceLinks
Awesome List for Data Operations
☆24Updated 5 years ago
Alternatives and similar repositories for dataOpsResource
Users that are interested in dataOpsResource are comparing it to the libraries listed below
Sorting:
- Code examples showing flow deployment to various types of infrastructure☆110Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- ☆23Updated 4 years ago
- PyCon Talks 2022 by Antoine Toubhans☆23Updated 3 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 4 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Updated 2 years ago
- A curated list of awesome DataOps tools☆225Updated last month
- Big Data Demystified meetup and blog examples☆31Updated last year
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆83Updated last year
- Simple samples for writing ETL transform scripts in Python☆24Updated 2 weeks ago
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- Template for data pipelines, ML workflows, API dev and monitoring☆44Updated 2 years ago
- Automated Jupyter notebook testing. 📙☆40Updated 2 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- 🐍💨 Airflow tutorial for PyCon 2019☆88Updated 3 years ago
- New generation opensource data stack☆76Updated 3 years ago
- CLI for data platform☆20Updated 2 months ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆106Updated last week
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- ☆31Updated 2 years ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 6 years ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆96Updated last year
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆57Updated 7 months ago
- 🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & …☆215Updated 2 years ago
- A downloadable pdf containing summary of frequently used pandas operations.☆10Updated 5 years ago
- Interactive cleaning for Pandas DataFrames☆16Updated 6 years ago
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆72Updated 8 months ago