chen1649chenli / dataOpsResource
Awesome List for Data Operations
☆24Updated 4 years ago
Alternatives and similar repositories for dataOpsResource:
Users that are interested in dataOpsResource are comparing it to the libraries listed below
- Full stack data engineering tools and infrastructure set-up☆47Updated 3 years ago
- Awesome list of dataops products, open source and resources☆24Updated 2 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- A curated list of awesome DataOps tools☆169Updated 3 months ago
- Notebooks for the ML Link Prediction Course☆14Updated 4 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- A tool to automatically infer columns data types in .csv files☆35Updated last year
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- Best practices for engineering ML pipelines.☆37Updated 2 years ago
- ☆30Updated last year
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- This repo is an approach to TDD in machine learning model operation. it covers project structure, testing essentials using pytest with Gi…☆15Updated 4 years ago
- real-time data + ML pipeline☆54Updated this week
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆70Updated last year
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆82Updated this week
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆78Updated 4 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated last year
- Demo on how to use Prefect 2 in an ML project☆40Updated 2 years ago
- Awesome list for datapipeline☆31Updated last year
- Content for a talk on "The wonderful world of data quality tools in Python"☆19Updated 3 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 3 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 6 months ago
- Big Data Demystified meetup and blog examples☆31Updated 5 months ago
- CLI for data platform☆19Updated last year