mozilla / python_mozetl
ETL jobs for Firefox Telemetry
☆27Updated last week
Alternatives and similar repositories for python_mozetl:
Users that are interested in python_mozetl are comparing it to the libraries listed below
- Aggregator job for Telemetry.☆8Updated last year
- Telemetry Analysis Service☆36Updated 5 years ago
- Schemas for Mozilla's data ingestion pipeline and data lake outputs☆48Updated this week
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Updated 5 years ago
- A library for creating full representations of Mozilla telemetry pings.☆11Updated last month
- Spark bindings for Mozilla Telemetry☆15Updated last year
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆44Updated 2 months ago
- Repository for public analyses.☆5Updated 3 years ago
- Collection of dockerized ETL jobs managed by data engineering.☆20Updated this week
- Botoflow is an asynchronous framework for Amazon SWF that helps you build SWF applications using Python☆13Updated 2 years ago
- ☆24Updated 5 years ago
- ☆8Updated 4 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- Documentation and implementation of telemetry ingestion on Google Cloud Platform☆83Updated last week
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- A python client library for the Stitch Import API☆42Updated last year
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆34Updated 2 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 5 years ago
- Airflow configuration for Telemetry☆186Updated last week
- LookML Generator for Glean and Mozilla Data☆20Updated this week
- A package providing helpers for authenticating to Google APIs.☆39Updated 3 months ago
- a declarative ETL framework that enforces data engineer best practices☆39Updated 7 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- An example PySpark project with pytest☆16Updated 7 years ago
- Ansible role to deploy and configure Airflow☆41Updated last month
- A collection of datasets and databases☆24Updated 6 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated last year