mozilla / telemetry-streamingLinks
Spark Streaming ETL jobs for Mozilla Telemetry
☆18Updated 5 years ago
Alternatives and similar repositories for telemetry-streaming
Users that are interested in telemetry-streaming are comparing it to the libraries listed below
Sorting:
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆35Updated 3 years ago
- ETL jobs for Firefox Telemetry☆28Updated 2 months ago
- Schemas for Mozilla's data ingestion pipeline and data lake outputs☆48Updated this week
- Aggregator job for Telemetry.☆8Updated last year
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Spark bindings for Mozilla Telemetry☆15Updated last year
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- ☆14Updated 10 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Last-seen sketch implementation in Go☆16Updated 4 years ago
- Elasticsearch REPL built on top of Jest☆23Updated 10 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 5 months ago
- Data-ish exploration through SQL+Uncertainty☆27Updated 2 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- Mozilla Services Data Pipeline☆30Updated 6 years ago
- Twitter Streaming API Example with Kafka Streams in Scala☆49Updated 8 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- Templates for projects based on top of H2O.☆38Updated 3 months ago
- Microsoft Azure Data Lake Store Filesystem Library for Java☆21Updated 2 weeks ago
- ☆10Updated 7 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Updated last year
- An open source enterprise data warehousing and analysis platform.☆21Updated 3 years ago
- Taskcluster CLI☆16Updated 5 years ago
- An implementation of AsyncHBase but on top of Google's Cloud Bigtable service☆23Updated 3 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- Deprecated, use https://github.com/mozilla-services/iprepd☆15Updated 7 years ago
- Code and Data for Getting Started documentation☆20Updated 2 years ago
- Sandbox for Apache nifi☆24Updated 3 years ago