treasure-data / luigi-td-exampleLinks
Example Repository for Building Complex Data Pipeline with Luigi +TD
☆24Updated 10 years ago
Alternatives and similar repositories for luigi-td-example
Users that are interested in luigi-td-example are comparing it to the libraries listed below
Sorting:
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- ☆54Updated 8 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- A luigi powered analytics / warehouse stack☆88Updated 8 years ago
- Example for an airflow plugin☆49Updated 9 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Required packages for using pandas in AWS Lambda functions☆45Updated 9 years ago
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆45Updated last month
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- This service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about …☆45Updated 6 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆150Updated 8 years ago
- An example mini data warehouse for python project stats, template for new projects☆178Updated 5 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- The Scalding tutorial as a standalone SBT project☆51Updated 7 years ago
- Airflow configuration for Telemetry☆195Updated last week
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- Serializes data into a JSON format using AVRO schema.☆138Updated 3 years ago
- DataPipeline for humans.☆249Updated 3 years ago
- Luigi Workflow Engine integration for Treasure Data☆16Updated 7 years ago
- Luigi Plugin for Hubot☆36Updated 9 years ago
- An extendable Docker image for Airbnb's Superset platform, previously known as Caravel.☆114Updated 3 years ago
- PyAthenaJDBC is an Amazon Athena JDBC driver wrapper for the Python DB API 2.0 (PEP 249).☆95Updated 2 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 5 years ago
- A tool for moving tables from Redshift to BigQuery☆65Updated 6 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 5 months ago
- SQS-based Python SDK for streaming data in realtime to the Panoply platform☆17Updated 3 months ago
- Arbalest is a Python data pipeline orchestration library for Amazon S3 and Amazon Redshift. It automates data import into Redshift and ma…