mozilla / python_mozetlLinks
ETL jobs for Firefox Telemetry
☆28Updated last month
Alternatives and similar repositories for python_mozetl
Users that are interested in python_mozetl are comparing it to the libraries listed below
Sorting:
- A library for creating full representations of Mozilla telemetry pings.☆11Updated this week
- Aggregator job for Telemetry.☆8Updated last year
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Updated 5 years ago
- Collection of dockerized ETL jobs managed by data engineering.☆18Updated this week
- Telemetry Analysis Service☆36Updated 5 years ago
- Schemas for Mozilla's data ingestion pipeline and data lake outputs☆48Updated this week
- Airflow configuration for Telemetry☆189Updated this week
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- Repository for public analyses.☆5Updated 3 years ago
- Spark bindings for Mozilla Telemetry☆15Updated last year
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆44Updated last month
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- An extension for Jupyter notebooks that allows running notebooks inside a Docker container and converting them to runnable Docker images.☆28Updated last year
- An example PySpark project with pytest☆16Updated 7 years ago
- Ansible role to deploy and configure Airflow☆41Updated this week
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- ☆8Updated 4 years ago
- A package providing helpers for authenticating to Google APIs.☆39Updated 4 months ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- CLI for creating databases for Data Quality Dashboards.☆19Updated 5 years ago
- ⚠ This is 100% vaporware. It's something I wish existed. Maybe I'll build it some day... want to help? ⚠☆11Updated 4 years ago
- PREVIEW - SQL databases in Bonobo, using sqlalchemy☆25Updated 2 years ago
- Postgraas is a super simple PostgreSQL-as-a-service☆29Updated 5 years ago
- A toolset to streamline running spark python on EMR☆20Updated 8 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- A python client library for the Stitch Import API☆42Updated last year
- High Level Kafka Scanner☆19Updated 7 years ago