chen1649chenli / dataOpsResourceLinks
Awesome List for Data Operations
☆24Updated 4 years ago
Alternatives and similar repositories for dataOpsResource
Users that are interested in dataOpsResource are comparing it to the libraries listed below
Sorting:
- CLI for data platform☆19Updated last year
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- real-time data + ML pipeline☆54Updated 2 weeks ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 6 years ago
- Big Data Demystified meetup and blog examples☆31Updated 11 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- A frictionless integrated platform for notebook☆83Updated 2 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆99Updated this week
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated 9 months ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Best practices for engineering ML pipelines.☆35Updated 3 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆84Updated 2 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆22Updated 2 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 3 years ago
- ☆12Updated 3 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- The easiest way to integrate Kedro and Great Expectations☆52Updated 2 years ago
- Sample projects using Ploomber.☆86Updated last year
- Demo on how to use Prefect 2 in an ML project☆41Updated 2 years ago
- ☆29Updated last year
- A curated list of dagster code snippets for data engineers☆56Updated last year
- Projects developed by Domino's R&D team☆78Updated 3 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆88Updated last month