chen1649chenli / dataOpsResourceLinks
Awesome List for Data Operations
☆24Updated 5 years ago
Alternatives and similar repositories for dataOpsResource
Users that are interested in dataOpsResource are comparing it to the libraries listed below
Sorting:
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- Code examples showing flow deployment to various types of infrastructure☆110Updated 2 years ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆103Updated this week
- A curated list of awesome DataOps tools☆207Updated 3 months ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- A collection of python utility functions☆11Updated last year
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 4 years ago
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆91Updated this week
- ∞ Priceloop Engineering Conventions for Scala, Python, Git Workflow etc☆100Updated 3 years ago
- Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark☆74Updated 2 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆58Updated 3 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated last year
- Best practices for engineering ML pipelines.☆36Updated 3 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated 2 years ago
- A frictionless integrated platform for notebook☆83Updated 2 years ago
- Template for data pipelines, ML workflows, API dev and monitoring☆44Updated last year
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- Simple samples for writing ETL transform scripts in Python☆23Updated 2 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 3 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆23Updated 3 years ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆89Updated 10 months ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆89Updated 2 years ago
- ☆23Updated 4 years ago
- PyCon Talks 2022 by Antoine Toubhans☆23Updated 3 years ago