Schemas for Mozilla's data ingestion pipeline and data lake outputs
☆50Mar 17, 2026Updated this week
Alternatives and similar repositories for mozilla-pipeline-schemas
Users that are interested in mozilla-pipeline-schemas are comparing it to the libraries listed below
Sorting:
- Mozilla Services Data Pipeline☆30Mar 28, 2019Updated 6 years ago
- Telemetry onboarding material☆11Apr 1, 2020Updated 5 years ago
- ☆28Jul 31, 2021Updated 4 years ago
- Bigquery ETL☆331Updated this week
- The project behind https://download.mozilla.org/ 🔥☆14Mar 9, 2026Updated last week
- Steve Fink's random development tools☆17Jan 14, 2026Updated 2 months ago
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Dec 5, 2019Updated 6 years ago
- A Primer for Web Performance Timing APIs☆22Jul 21, 2020Updated 5 years ago
- agogosml is a flexible data processing pipeline that addresses the common need for operationalizing ML models at scale☆34May 3, 2019Updated 6 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Feb 13, 2020Updated 6 years ago
- ☆19Feb 23, 2026Updated 3 weeks ago
- Mirror of https://hg.mozilla.org/hgcustom/version-control-tools☆31Feb 27, 2026Updated 3 weeks ago
- LookML Generator for Glean and Mozilla Data☆24Updated this week
- Selenium WebDriver compatible page-object model and utilities for Firefox Accounts☆11Sep 18, 2019Updated 6 years ago
- Collection of dockerized ETL jobs managed by data engineering.☆21Mar 12, 2026Updated last week
- Apache Airflow CI pipeline☆19Jun 12, 2019Updated 6 years ago
- Hindsight Administration User Interface☆11Jun 4, 2022Updated 3 years ago
- Web index for Mozilla data tools and docs☆95Mar 11, 2026Updated last week
- ETL jobs for Firefox Telemetry☆29Nov 7, 2025Updated 4 months ago
- Grouper Python Client Library☆10Apr 18, 2023Updated 2 years ago
- Service to deliver sponsored content while preserving privacy. Owned by the Ads team. Deployed in GCP.☆18May 21, 2024Updated last year
- This program post-processes the stack frames produced by `MozFormatCodeAddress()`.☆22Apr 4, 2023Updated 2 years ago
- This repository contains implementation to process private data shares collected according to the Exposure Notification Private Analytics…☆12Sep 19, 2024Updated last year
- Solve problems of device identity, certificates and the keychain.☆13Jan 3, 2019Updated 7 years ago
- Log analysis pipeline utilizing Apache Beam☆25Jul 5, 2023Updated 2 years ago
- OWASP Zed Attack Proxy plugin for py.test☆13Sep 10, 2015Updated 10 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 6 years ago
- DEPRECATED: Mozilla Build Metadata Service☆13Jun 27, 2019Updated 6 years ago
- From the medium article about Customer Retention☆11Nov 20, 2019Updated 6 years ago
- A small mocking library for Rust☆14Aug 26, 2018Updated 7 years ago
- A configuration tool for virtual domain email for Postfix and Dovecot☆16May 15, 2016Updated 9 years ago
- A collection of tools that help me work with Avro☆24Jan 7, 2010Updated 16 years ago
- A simple browser for Git repositories - (mirror of https://gerrit.googlesource.com/gitiles)☆23Updated this week
- Webfinger client library for Node.js☆40Oct 30, 2018Updated 7 years ago
- Firefox's performance dashboard☆71Mar 12, 2026Updated last week
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- DEPRECATED☆12Oct 18, 2021Updated 4 years ago
- distributed transaction processor☆16Updated this week
- Automated scripts for installing dedicated wptagent instances☆13Jan 17, 2025Updated last year