mozilla/emr-bootstrap-spark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mozilla/emr-bootstrap-spark)

mozilla / emr-bootstrap-spark

AWS bootstrap scripts for Mozilla's flavoured Spark setup.

☆47

Alternatives and similar repositories for emr-bootstrap-spark

Users that are interested in emr-bootstrap-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mozilla / telemetry-streaming
View on GitHub
Spark Streaming ETL jobs for Mozilla Telemetry
☆18Dec 5, 2019Updated 6 years ago
looker / lookml-test-runner
View on GitHub
An experimental test runner for LookML models.
☆11Sep 17, 2021Updated 4 years ago
mozilla / python_moztelemetry
View on GitHub
Spark bindings for Mozilla Telemetry
☆15Jan 22, 2026Updated 6 months ago
mozilla / python_mozetl
View on GitHub
ETL jobs for Firefox Telemetry
☆29May 7, 2026Updated 2 months ago
hammerlab / spark-tests
View on GitHub
Utilities for writing tests that use Apache Spark.
☆24Dec 29, 2018Updated 7 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mozilla / telemetry-airflow
View on GitHub
Airflow configuration for Telemetry
☆205Jul 17, 2026Updated last week
mozilla / telemetry-batch-view
View on GitHub
A Scala framework to build derived datasets, aka batch views, of Telemetry data.
☆35Jun 24, 2022Updated 4 years ago
mozilla / telemetry-analysis-service
View on GitHub
Telemetry Analysis Service
☆38Dec 4, 2019Updated 6 years ago
bartTC / django-comments-spamfighter
View on GitHub
Not in active development; see README -- A Django app that contributes Akismet and Keyword blocking to your django comments.
☆18Mar 19, 2013Updated 13 years ago
praekelt / django-generate
View on GitHub
Django slightly smarter than fixtures content generation app.
☆19Aug 27, 2015Updated 10 years ago
clangupc / clang-upc
View on GitHub
Clang UPC Front-End
☆17Jan 24, 2022Updated 4 years ago
David-Levinthal / machine-learning
View on GitHub
repository for notes and data from machine learning studies
☆13Dec 16, 2019Updated 6 years ago
mozilla / videur
View on GitHub
Deprecated: Lua scripts for Nginx
☆40Aug 17, 2020Updated 5 years ago
flatironinstitute / flathub
View on GitHub
A simple elasticsearch frontend for serving astrophysical simulation catalog data
☆11Mar 14, 2026Updated 4 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wirelessr / flink-iceberg-playground
View on GitHub
minio as local storage and DynamoDB as catalog
☆15May 14, 2024Updated 2 years ago
srout60 / justmeandopensource
View on GitHub
☆12Sep 25, 2019Updated 6 years ago
andrewvy / ansible-elixir
View on GitHub
Simple role for deploying Elixir Exrm releases.
☆10Jan 28, 2016Updated 10 years ago
nathanhumbert / doublecheck
View on GitHub
Test pages listed in a sitemap.
☆10Jan 7, 2015Updated 11 years ago
jfcloutier / robotex
View on GitHub
Adventures in robotics with Mindstorm EV3 and Elixir
☆12Dec 30, 2019Updated 6 years ago
vvaks0 / AvroSchemaShredder
View on GitHub
Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…
☆13Jan 11, 2017Updated 9 years ago
mediapop / datetime_truncate
View on GitHub
Truncate datetime objects to the specifiec level of precision, inspired by PostgreSQL's DATE_TRUNC.
☆15Apr 20, 2021Updated 5 years ago
mslinn / sbtTemplate
View on GitHub
SBT template for projects written in Scala and other JVM languages
☆13Dec 29, 2021Updated 4 years ago
alexzeitgeist / docker-bcompare
View on GitHub
Dockerized Beyond Compare
☆10Mar 28, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sul-dlss-deprecated / sparqlight
View on GitHub
[DEPRECATED] A blacklight application using SPARQL to replace Solr
☆12Mar 27, 2020Updated 6 years ago
ckan / ckanext-report
View on GitHub
CKAN report infrastructure
☆18Mar 2, 2026Updated 4 months ago
ottomata / kafka-connect-jsonschema
View on GitHub
Kafka Connect Converter using JSONSchema
☆14Oct 5, 2022Updated 3 years ago
zuazo / postfixadmin-cookbook
View on GitHub
Chef cookbook to install and configure PostfixAdmin.
☆13Mar 17, 2018Updated 8 years ago
mozilla / puente
View on GitHub
UNMAINTAINED: Django/Jinja2 l10n extract/merge commands and things (Tower replacement)
☆14May 11, 2022Updated 4 years ago
bernhard-42 / pyspark-atlas
View on GitHub
PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection
☆17Jan 12, 2017Updated 9 years ago
mistercrunch / EToiLe
View on GitHub
a declarative ETL framework that enforces data engineer best practices
☆40Aug 31, 2017Updated 8 years ago
allenai / pipeline
View on GitHub
Library for building reproducible data pipelines to support experimentation
☆20Dec 16, 2015Updated 10 years ago
ANXS / nodejs
View on GitHub
Ansible role for nodejs
☆22Jun 17, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
chriswayg / toolbox
View on GitHub
A small Alpine Linux based toolbox for Docker on CoreOS with various admin tools
☆12Apr 28, 2020Updated 6 years ago
andrewstuart / kube-gen-certs
View on GitHub
Generate kubernetes ingress TLS certificates automatically via Vault
☆11Mar 28, 2018Updated 8 years ago
cvitter / Easy-Time-Series-Analysis-with-Riak-TS
View on GitHub
Code and from the Easy Time Series Analysis with Riak TS, Python, Pandas & Jupyter meetup
☆11May 18, 2016Updated 10 years ago
realies / audiowaveform-docker
View on GitHub
⛴ Audiowaveform Docker Container
☆13May 23, 2026Updated 2 months ago
joshbuddy / rack-cache-while-revalidate
View on GitHub
Works with Rack::Cache to serve up stale data while silently revalidating
☆18Dec 3, 2009Updated 16 years ago
aws-samples / aws-glue-data-catalog-replication-utility
View on GitHub
Replication utility for AWS Glue Data Catalog
☆80Aug 8, 2024Updated last year
NuCivic / react-dash-boilerplate
View on GitHub
Quick-start for react-dash projects
☆15Aug 18, 2017Updated 8 years ago