mozilla / telemetry-streaming
Spark Streaming ETL jobs for Mozilla Telemetry
☆18Updated 5 years ago
Alternatives and similar repositories for telemetry-streaming:
Users that are interested in telemetry-streaming are comparing it to the libraries listed below
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆34Updated 2 years ago
- Aggregator job for Telemetry.☆8Updated last year
- ETL jobs for Firefox Telemetry☆28Updated 5 months ago
- Spark bindings for Mozilla Telemetry☆14Updated last year
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Apache Amaterasu☆56Updated 5 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Automatic alert system for telemetry histograms☆8Updated 4 years ago
- Mozilla Services Data Pipeline☆30Updated 5 years ago
- Data-ish exploration through SQL+Uncertainty☆27Updated 2 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 2 weeks ago
- Repository for public analyses.☆5Updated 3 years ago
- An open source enterprise data warehousing and analysis platform.☆21Updated 3 years ago
- A collection of Scala graph libraries and adapters for graph databases.☆14Updated 8 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Apache Beam Site☆29Updated this week
- Kaltura's next generation Analytics solution based on Spark, Cassandra and Kafka☆12Updated last year
- A collection of datasets and databases☆24Updated 6 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- Library for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.☆45Updated last week
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆111Updated 4 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- ☆48Updated 5 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 7 years ago