reljicd / ml-airflow
Generalized project for running Airflow DAGs, with possibility of skipping tasks already done for some set of input parameters.
☆14Updated last year
Related projects: ⓘ
- A few end to end examples that use data-describe☆16Updated last year
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- Config files for setting up Multitenant Kubeflow on AWS with spot instances☆10Updated 4 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆17Updated 3 years ago
- ☆20Updated 2 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Code examples for the Introduction to Kubeflow course☆13Updated 3 years ago
- event-triggered plugins for airflow☆21Updated 4 years ago
- Best practices for engineering ML pipelines.☆37Updated 2 years ago
- Example project for running LensKit experiments☆13Updated 10 months ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆81Updated 5 years ago
- AWS Big Data Certification☆24Updated last year
- Follow the Lumiata Tech Blog on Medium!☆21Updated last year
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Examples for using Amazon SageMaker components in Kubeflow Pipelines☆22Updated 4 years ago
- Repo for all my code on the articles I post on medium☆105Updated last year
- How to do data science with Optimus, Spark and Python.☆18Updated 5 years ago
- ☆14Updated 6 years ago
- Example to implement machine learning microservice with gRPC and Docker in Python☆81Updated 2 years ago
- Basic tutorial of using Apache Airflow☆35Updated 5 years ago
- Techniques for Scraping the Web in Python☆24Updated 6 years ago
- Record matching and entity resolution at scale in Spark☆31Updated 10 months ago
- Data mining algorithms with Python☆10Updated 5 years ago
- ☆12Updated 3 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 5 years ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆81Updated 4 months ago
- An ML project template with sensible defaults☆37Updated 2 years ago