A toolset to streamline running spark python on EMR
☆20Nov 16, 2016Updated 9 years ago
Alternatives and similar repositories for pyspark-emr
Users that are interested in pyspark-emr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Quickstart PySpark with Anaconda on AWS/EMR☆52Jan 9, 2017Updated 9 years ago
- ☆12Jun 3, 2016Updated 9 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Oct 18, 2023Updated 2 years ago
- Ambari stack service for easily installing and managing NTPD on HDP cluster☆14Apr 3, 2018Updated 8 years ago
- An opinionated Kafka producer/consumer built on top of confluent-kafka-python/librdkafka☆28Apr 23, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems.…☆11Jul 29, 2017Updated 8 years ago
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 7 months ago
- ☆12Oct 16, 2023Updated 2 years ago
- Terraform Module to create a Apache Zookeeper cluster on AWS☆13Jan 3, 2022Updated 4 years ago
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- python script to repair the primary range of a node in N discrete steps☆12Aug 3, 2018Updated 7 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- Grafana Prometheus exporter☆10Oct 17, 2017Updated 8 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 7 months ago
- Docker compose files for various kafka stacks☆32Feb 24, 2018Updated 8 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Jan 11, 2017Updated 9 years ago
- API REST boilerplate using Spring Boot and Redis as database☆13Dec 26, 2018Updated 7 years ago
- Packer Template to build a AWS Apache Cassandra AMI☆10Jan 3, 2022Updated 4 years ago
- Sample Docker Compose files for running Apache Ambari☆11Oct 29, 2018Updated 7 years ago
- ☆12Apr 27, 2018Updated 8 years ago
- Subset Met Office MOGREPS-UK and UKV on AWS EC2☆12Oct 22, 2021Updated 4 years ago
- ☆10Jan 31, 2016Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Jiraya - Simple Jira CLI☆17Dec 13, 2019Updated 6 years ago
- ☆10Mar 31, 2021Updated 5 years ago
- Custom Alerts for Ambari server☆12Jul 27, 2015Updated 10 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆48Jan 7, 2025Updated last year
- BlockChain DApp using Angular☆10Sep 24, 2018Updated 7 years ago
- Hi Spring fans! Welcome to a quick, mid-interregnum installment of Spring Tips in which we look at a few features that let you be both la…☆13Mar 14, 2019Updated 7 years ago
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Apr 23, 2025Updated last year
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Jan 2, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This is a fork of the Apache Flink Kinesis connector adding Enhanced Fanout support for Flink 1.8/1.11 on KDA.☆24Mar 1, 2026Updated 2 months ago
- Tool to migrate Prometheus 1.x data directories to the 2.0 format.☆14Jan 18, 2018Updated 8 years ago
- ☆14Sep 18, 2016Updated 9 years ago
- Prototype Pandemic Unemployment Assistance (PUA) claim service☆12Dec 2, 2021Updated 4 years ago
- A repository with different graph processing tehnologies☆11Nov 30, 2015Updated 10 years ago
- https://www.packtpub.com/books/info/authors/tomasz-lelek☆13Oct 30, 2021Updated 4 years ago
- Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR☆23Mar 3, 2020Updated 6 years ago