chen1649chenli / dataOpsResourceLinks
Awesome List for Data Operations
☆24Updated 5 years ago
Alternatives and similar repositories for dataOpsResource
Users that are interested in dataOpsResource are comparing it to the libraries listed below
Sorting:
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆88Updated 2 years ago
- A curated list of awesome DataOps tools☆202Updated 2 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- Code examples showing flow deployment to various types of infrastructure☆109Updated 2 years ago
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- ODD Specification is a universal open standard for collecting metadata.☆144Updated 11 months ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆103Updated 3 weeks ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆23Updated 3 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Pandas helper functions☆31Updated 2 years ago
- Making DAG construction easier☆272Updated 3 weeks ago
- New generation opensource data stack☆73Updated 3 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Demo on how to use Prefect 2 in an ML project☆41Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated last year
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆58Updated 3 years ago
- A frictionless integrated platform for notebook☆83Updated 2 years ago
- Templates for your Kedro projects.☆79Updated 3 weeks ago
- A curated list of dagster code snippets for data engineers☆57Updated last year
- scaffold of Apache Airflow executing Docker containers☆86Updated 2 years ago
- Apache Spark Guide☆34Updated 3 years ago
- Sample projects using Ploomber.☆86Updated last year
- A downloadable pdf containing summary of frequently used pandas operations.☆10Updated 5 years ago
- A collection of python utility functions☆11Updated last year