zendesk / pakkrLinks
Python pipeline utility library
☆18Updated 2 years ago
Alternatives and similar repositories for pakkr
Users that are interested in pakkr are comparing it to the libraries listed below
Sorting:
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 2 years ago
- Scala Aggregators used for ML Model metrics monitoring☆91Updated 2 years ago
- ☆46Updated last year
- ☆54Updated 8 years ago
- A Giter8 template for scio☆31Updated last week
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Updated 2 years ago
- A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)☆72Updated 5 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆134Updated 3 years ago
- Declarative interface for building images and running commands in containers using Docker.☆36Updated 2 years ago
- A lightweight library intended to make developing Google DataFlow jobs in Scala easier.☆14Updated 5 years ago
- Bazel rules for pip requirements☆31Updated 4 years ago
- ☆95Updated 2 years ago
- Puppet module to provision Airbnb's Airflow☆20Updated 3 years ago
- Tool for exploring data on an Apache Kafka cluster☆42Updated 4 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆268Updated 2 years ago
- ☆14Updated 3 years ago
- Airflow declarative DAGs via YAML☆133Updated 2 years ago
- A curated list of awesome resources for Apache Beam☆145Updated 3 years ago
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆256Updated 4 months ago
- spark-emr☆15Updated 11 years ago
- A CLI and library to run Singer Taps and Targets☆35Updated 3 years ago
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆194Updated last month
- Streaming left joins in Kafka for change data capture☆52Updated last year
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- Building Scio from scratch step by step☆20Updated 6 years ago
- Bazel rules for compiling Play Framework routes files☆11Updated 6 months ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 7 years ago
- GCS support for avro-tools, parquet-tools and protobuf☆77Updated 6 months ago
- A tool for data sampling, data generation, and data diffing☆345Updated 7 months ago