datitran/emr-bootstrap-pyspark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/datitran/emr-bootstrap-pyspark)

datitran / emr-bootstrap-pyspark

Quickstart PySpark with Anaconda on AWS/EMR

☆53

Alternatives and similar repositories for emr-bootstrap-pyspark

Users that are interested in emr-bootstrap-pyspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

crawles / text-analytics-service-example
View on GitHub
Deploy sentiment analysis using Flask
☆18Oct 27, 2019Updated 6 years ago
aws-samples / dbtgluenyctaxidemo
View on GitHub
☆11Oct 11, 2022Updated 3 years ago
datitran / spark-tdd-example
View on GitHub
A simple Spark TDD example
☆26Sep 19, 2017Updated 8 years ago
lazyprogrammer / matlab-probability-class
View on GitHub
Resources and Materials for MATLAB Probability class
☆10Oct 23, 2015Updated 10 years ago
dimajix / docker-jupyter-spark
View on GitHub
Docker image for Jupyter notebooks with PySpark
☆28Aug 3, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
PorkShoulderHolder / graphiteVR
View on GitHub
Visualizing twitter data in a collaborative VR environment
☆10Apr 8, 2016Updated 10 years ago
awsdocs / amazon-cloudsearch-developer-guide
View on GitHub
Content for the Amazon CloudSearch Developer Guide
☆10Jun 15, 2023Updated 3 years ago
awsdocs / application-auto-scaling-user-guide
View on GitHub
The open source version of the Application Auto Scaling User Guide. To submit feedback or requests for changes, submit an issue or make c…
☆11Jun 15, 2023Updated 3 years ago
awskrug / cli-group
View on GitHub
AWSKRUG CLI Small Group
☆10Jan 9, 2020Updated 6 years ago
thomhopmans / pythom
View on GitHub
Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl
☆71Mar 17, 2023Updated 3 years ago
awsdocs / aws-client-vpn-administrator-guide
View on GitHub
The open source version of the AWS Client VPN Administrator Guide. To submit feedback or requests for changes, submit an issue or make ch…
☆13Jun 15, 2023Updated 3 years ago
idealo / terraform-emr-pyspark
View on GitHub
Quickstart PySpark with Anaconda on AWS/EMR using Terraform
☆48Jan 7, 2025Updated last year
awsdocs / aws-toolkit-vs-code-user-guide
View on GitHub
The open source version of the AWS Toolkit for Visual Studio Code user guide. You can submit feedback & requests for changes by submittin…
☆14Jun 16, 2023Updated 3 years ago
Jcharis / streamlit_todo_crud_app
View on GitHub
Streamlit ToDo CRUD App
☆28Jun 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dennissergeev / exoconvection-apj-2020
View on GitHub
Code for Sergeev et al. (2020)
☆14Apr 15, 2023Updated 3 years ago
firmai / DeepLearningForTimeSeriesForecasting
View on GitHub
A tutorial demonstrating how to implement deep learning models for time series forecasting
☆12Jan 28, 2020Updated 6 years ago
awslabs / amazon-s3-tagging-spark-util
View on GitHub
☆12Oct 16, 2023Updated 2 years ago
awsdocs / amazon-s3-getting-started-guide
View on GitHub
This guide has been archived. Please see https://github.com/awsdocs/amazon-s3-userguide for an open source version of the Amazon S3 docs.…
☆20Jan 20, 2021Updated 5 years ago
newfront / spark-intro-to-ml
View on GitHub
A Gentle introduction to Machine Learning with Apache Spark
☆11Mar 2, 2026Updated 4 months ago
vatsan / dspcfboilerplate
View on GitHub
Boilerplate code for flask apps on PCF that interact with a backend environment (ex: Pivotal BDS or ElephantSQL).
☆12Sep 1, 2016Updated 9 years ago
joomcode / spark-platform
View on GitHub
Basic Spark utilities
☆13Updated this week
jmportilla / Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
View on GitHub
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-firs…
☆11Jun 10, 2015Updated 11 years ago
jmportilla / SQL-Appendix
View on GitHub
Appendix
☆14Apr 16, 2015Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aws-samples / aws-concurrent-data-orchestration-pipeline-emr-livy
View on GitHub
This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…
☆76Oct 30, 2018Updated 7 years ago
jmportilla / urban-data-science
View on GitHub
Course materials, IPython notebooks, tutorials, and guides for the urban data science course
☆12Mar 27, 2016Updated 10 years ago
appsecco / docker-data-science-toolbox
View on GitHub
Data Science Command Line Toolbox in a docker container
☆31Jun 6, 2018Updated 8 years ago
jamartinh / Orange3-Spark
View on GitHub
A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML
☆15Dec 24, 2016Updated 9 years ago
vmware-archive / tasa
View on GitHub
Topic and sentiment analysis of tweets (demo)
☆11Mar 21, 2019Updated 7 years ago
mark-hoffmann / fastteradata
View on GitHub
Tools for faster and optimized interaction with Teradata and large datasets.
☆17Jul 11, 2018Updated 8 years ago
Sharkaboi / DrawingsApp
View on GitHub
An app to add and manage floor plan drawings with markers.
☆28Jul 20, 2021Updated 5 years ago
jmportilla / DeepLearningTutorials
View on GitHub
Deep Learning Tutorial notes and code. See the wiki for more info.
☆10Jun 22, 2015Updated 11 years ago
yoonje / elastic-stack-tutorial
View on GitHub
ELK 튜토리얼
☆11Mar 15, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yrbszhsh / Principles-of-Artificial-Intelligence-Learning-Algorithms
View on GitHub
My work on UCSD CSE 250B Principles of Artificial Intelligence: Learning Algorithms
☆13Jul 24, 2019Updated 7 years ago
cerndb / sparkMeasure
View on GitHub
This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…
☆16May 21, 2026Updated 2 months ago
javieraviles / spring-boot-redis-rest
View on GitHub
API REST boilerplate using Spring Boot and Redis as database
☆13Dec 26, 2018Updated 7 years ago
nuxeo-archives / nuxeo-signature
View on GitHub
Digital signature addon for signing PDF files
☆10Apr 10, 2019Updated 7 years ago
dream2globe / SparkDefinitiveGuide
View on GitHub
The Example Codes of "Spark, The Definitive Guide"
☆12Nov 15, 2020Updated 5 years ago
awsdocs / elb-application-load-balancers-user-guide
View on GitHub
The open source version of the User Guide for Application Load Balancers. To submit feedback or requests for changes, submit an issue or …
☆26Jun 15, 2023Updated 3 years ago
divyam-rai / simple-kafka-sasl-docker-python
View on GitHub
Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…
☆12Dec 29, 2021Updated 4 years ago