zendesk / pakkr
Python pipeline utility library
☆18Updated last year
Alternatives and similar repositories for pakkr:
Users that are interested in pakkr are comparing it to the libraries listed below
- ☆14Updated 2 years ago
- A GitHub app which runs checks for flagged terminology in GitHub repos☆23Updated last year
- pazel - generate Bazel BUILD files for Python☆40Updated last year
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- ☆54Updated 7 years ago
- A lightweight library intended to make developing Google DataFlow jobs in Scala easier.☆14Updated 4 years ago
- Scala Aggregators used for ML Model metrics monitoring☆91Updated last year
- ☆24Updated 5 years ago
- An HFile-backed Key-Value Server☆43Updated 6 years ago
- Dynamically generate Buildkite pipelines based on project changes☆95Updated last year
- Puppet module to provision Airbnb's Airflow☆19Updated 2 years ago
- Concatenate Amazon S3 files remotely using flexible patterns☆38Updated 4 years ago
- Compile JSON Schema into Avro and BigQuery schemas☆44Updated last year
- Bazel rules for compiling Play Framework routes files☆11Updated last week
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- VGS edition of Google's safe and hermetically sealed Starlark language - a non-Turing complete subset of Python 3.☆32Updated this week
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- Building and packaging Python with Pants and PEX - an annotated example☆33Updated 2 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Bazel rules for pip requirements☆30Updated 4 years ago
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆34Updated 2 years ago
- a declarative ETL framework that enforces data engineer best practices☆39Updated 7 years ago
- jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source co…☆71Updated 6 years ago
- Garbage collector for Amazon ECR docker registry☆49Updated 3 years ago
- A global file-based mutex lock using Google Cloud Storage☆29Updated 6 years ago
- Run separate pipelines for each folder in your monorepo☆203Updated 10 months ago
- A CLI and library to run Singer Taps and Targets☆34Updated 3 years ago
- Metrics for airflow☆14Updated last year
- A Github API client to extract events and actions, and load into a database☆28Updated 3 years ago