mozilla-services / mozilla-pipeline-schemas
Schemas for Mozilla's data ingestion pipeline and data lake outputs
☆47Updated this week
Alternatives and similar repositories for mozilla-pipeline-schemas:
Users that are interested in mozilla-pipeline-schemas are comparing it to the libraries listed below
- ETL jobs for Firefox Telemetry☆27Updated 6 months ago
- Aggregator job for Telemetry.☆8Updated last year
- A guide for Mozilla's developers and data scientists to analyze and interpret the data gathered by our data collection systems.☆86Updated 3 weeks ago
- Spark bindings for Mozilla Telemetry☆15Updated last year
- Documentation and implementation of telemetry ingestion on Google Cloud Platform☆82Updated this week
- Scrape and publish Telemetry probe data from Firefox☆24Updated this week
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Updated 5 years ago
- Telemetry Analysis Service☆36Updated 5 years ago
- Mozilla Services Data Pipeline☆30Updated 5 years ago
- Repository for public analyses.☆5Updated 3 years ago
- A library for creating full representations of Mozilla telemetry pings.☆11Updated last month
- Home of Mozilla IAM change integration service repository.☆10Updated 3 months ago
- Library to access and aggregate several Mozilla data sources.☆10Updated 4 months ago
- Internal tool to manage release builds☆12Updated 5 years ago
- Airflow configuration for Telemetry☆185Updated this week
- LookML Generator for Glean and Mozilla Data☆20Updated this week
- ☆21Updated 2 months ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Open source tools for Google Cloud Storage and Databases.☆63Updated 10 months ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-connection☆29Updated last year
- Telemetry onboarding material☆11Updated 4 years ago
- Web server that receives gzip'd POST requests and saves them uncompressed locally☆10Updated 2 years ago
- ☆28Updated 3 years ago
- Telemetry-Aware Addon Recommender☆30Updated last year
- BigQuery import and processing pipelines☆67Updated 2 weeks ago
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆34Updated 2 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- Taskcluster team planning☆13Updated 4 months ago
- Database plugins☆14Updated this week
- Knowledge management where the data scientist is in control☆12Updated 4 years ago