markfink / jupyter-athena-sql
run SQL queries on AWS Athena from jupyter notebooks
☆19Updated 5 years ago
Related projects: ⓘ
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- Dask on ECS Fargate☆14Updated 4 years ago
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆30Updated 4 years ago
- ☆10Updated this week
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆19Updated 3 years ago
- Automatically loads new partitions in AWS Athena☆18Updated 4 years ago
- Airflow workflow management platform chef cookbook.☆67Updated 5 years ago
- A collection of airflow sample workflows for data processing on aws☆12Updated 6 years ago
- An opinionated template for spinning up a dask cluster based on docker.☆13Updated 6 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆22Updated last year
- Example Repository for Building Complex Data Pipeline with Luigi +TD☆24Updated 9 years ago
- Airflow code accompanying blog post.☆21Updated 5 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Infrastructure code to run notebooks on some EC2 nodes☆10Updated 6 years ago
- An extension for Jupyter notebooks that allows running notebooks inside a Docker container and converting them to runnable Docker images.☆28Updated last year
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Cloudformation templates for deploying Airflow in ECS☆40Updated 5 years ago
- ☆11Updated this week
- ☆20Updated this week
- A python client library for the Stitch Import API☆42Updated 8 months ago
- 💻 CLI for reporting events to Faros platform☆14Updated 2 weeks ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated 9 months ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Build Lambda deployment packages faster with Docker☆24Updated 7 months ago
- A toolset to streamline running spark python on EMR☆20Updated 7 years ago
- REST-like API exposing Airflow data and operations☆61Updated 5 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆19Updated last year
- ☆16Updated this week