A toolset to streamline running spark python on EMR
☆20Nov 16, 2016Updated 9 years ago
Alternatives and similar repositories for pyspark-emr
Users that are interested in pyspark-emr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Quickstart PySpark with Anaconda on AWS/EMR☆52Jan 9, 2017Updated 9 years ago
- Introductory interactive Jupyter tutorial providing details about ORMs in order to assist in the teaching of their use to computing scien…☆14Oct 21, 2025Updated 5 months ago
- ☆14Aug 10, 2021Updated 4 years ago
- ☆18Aug 28, 2024Updated last year
- Getting started with GESIS Notebooks☆15Oct 16, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Jun 3, 2016Updated 9 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 4 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- Material for the Jupytext+Papermill blog post☆31Jun 30, 2020Updated 5 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Oct 18, 2023Updated 2 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 3 weeks ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Oct 17, 2018Updated 7 years ago
- An opinionated Kafka producer/consumer built on top of confluent-kafka-python/librdkafka☆27Apr 30, 2025Updated 10 months ago
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Oct 16, 2023Updated 2 years ago
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- List of playbooks to manage Ambari☆13Oct 3, 2018Updated 7 years ago
- Sample demonstrating consuming Amazon Cognito Streams☆10Jun 15, 2020Updated 5 years ago
- A pyspark lib to validate data quality☆18Nov 11, 2022Updated 3 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 5 months ago
- API REST boilerplate using Spring Boot and Redis as database☆13Dec 26, 2018Updated 7 years ago
- Packer Template to build a AWS Apache Cassandra AMI☆10Jan 3, 2022Updated 4 years ago
- Sample Docker Compose files for running Apache Ambari☆10Oct 29, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- An example application to integrate Amazon API Gateway and Amazon Lambda.☆12Aug 5, 2015Updated 10 years ago
- ☆12Apr 27, 2018Updated 7 years ago
- Subset Met Office MOGREPS-UK and UKV on AWS EC2☆12Oct 22, 2021Updated 4 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 8 years ago
- Adds a framework to enable Natural Language interactions in your Hubot scripts☆11Dec 6, 2016Updated 9 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆48Jan 7, 2025Updated last year
- BlockChain DApp using Angular☆10Sep 24, 2018Updated 7 years ago
- Hi Spring fans! Welcome to a quick, mid-interregnum installment of Spring Tips in which we look at a few features that let you be both la…☆13Mar 14, 2019Updated 7 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Thoughts on things I find interesting.☆17Dec 19, 2024Updated last year
- Redash bootstrap script for CentOS.☆16Feb 10, 2018Updated 8 years ago
- ☆10Feb 5, 2017Updated 9 years ago
- A set of modules aimed to manipulate policies on Apache Ranger.☆13Jan 21, 2019Updated 7 years ago
- Spark Structured Streaming JDBC Sink☆16Apr 26, 2021Updated 4 years ago
- ☆23Oct 3, 2024Updated last year
- Packer Template to build a AWS Apache Zookeeper AMI☆14Jan 3, 2022Updated 4 years ago