zendesk / pakkrLinks
Python pipeline utility library
☆18Updated 2 years ago
Alternatives and similar repositories for pakkr
Users that are interested in pakkr are comparing it to the libraries listed below
Sorting:
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 2 years ago
- ☆14Updated 3 years ago
- Bazel rules for pip requirements☆31Updated 5 years ago
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆269Updated 2 years ago
- Rules for generating native Bazel Python libraries from requirements.txt☆15Updated 5 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Updated 2 years ago
- jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source co…☆71Updated 6 years ago
- ☆54Updated 8 years ago
- A curated list of awesome resources for Apache Beam☆145Updated 3 years ago
- pazel - generate Bazel BUILD files for Python☆41Updated 2 years ago
- Metadata service library for Amundsen☆82Updated 2 weeks ago
- Bazel rules to resolve and fetch artifacts transitively from the Python Package Index (PyPI)☆60Updated 5 years ago
- ☆24Updated 5 years ago
- Airflow declarative DAGs via YAML☆133Updated 2 years ago
- Airflow configuration for Telemetry☆199Updated this week
- Pylint plugin for static code analysis on Airflow code☆97Updated 5 years ago
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆193Updated 3 months ago
- Ephemeral Hadoop clusters using Google Compute Platform☆134Updated 3 years ago
- Puppet module to provision Airbnb's Airflow☆20Updated 3 years ago
- Observability for your AWS load balancers, CloudFront, and more☆51Updated last year
- Automatically source and unsource a project's environment☆151Updated last year
- ☆95Updated 2 years ago
- Parquet Command-line Tools☆19Updated 9 years ago
- Search service library for Amundsen☆54Updated 2 weeks ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- A lightweight library intended to make developing Google DataFlow jobs in Scala easier.☆14Updated 5 years ago
- Building and packaging Python with Pants and PEX - an annotated example☆33Updated 3 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆204Updated last month
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆35Updated 3 years ago
- Bazel build rules for machine learning workflows☆33Updated 3 years ago