mozilla / python_mozetl
ETL jobs for Firefox Telemetry
☆27Updated 6 months ago
Alternatives and similar repositories for python_mozetl:
Users that are interested in python_mozetl are comparing it to the libraries listed below
- Aggregator job for Telemetry.☆8Updated last year
- Telemetry Analysis Service☆36Updated 5 years ago
- Schemas for Mozilla's data ingestion pipeline and data lake outputs☆47Updated this week
- A library for creating full representations of Mozilla telemetry pings.☆11Updated last month
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Updated 5 years ago
- Spark bindings for Mozilla Telemetry☆15Updated last year
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Collection of dockerized ETL jobs managed by data engineering.☆19Updated 3 weeks ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆44Updated last month
- ☆8Updated 4 years ago
- transformpy is a Python 2/3 module for doing transforms on "streams" of data☆29Updated 7 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- A python client library for the Stitch Import API☆42Updated last year
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- Infrastructure for making a pandas release☆7Updated 2 years ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- Airflow configuration for Telemetry☆185Updated this week
- An example PySpark project with pytest☆17Updated 7 years ago
- Lightweight configuration and access to multiple databases in a single project☆38Updated last year
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Apache Airflow CI pipeline☆19Updated 5 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 5 years ago
- Home of Mozilla IAM change integration service repository.☆10Updated 3 months ago
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆34Updated 2 years ago
- An example of how to run a Python project w/ pipenv in a Buildkite pipeline☆15Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Botoflow is an asynchronous framework for Amazon SWF that helps you build SWF applications using Python☆13Updated 2 years ago