PiercingDan/spark-Jupyter-AWS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PiercingDan/spark-Jupyter-AWS)

PiercingDan / spark-Jupyter-AWS

A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support

☆260

Alternatives and similar repositories for spark-Jupyter-AWS

Users that are interested in spark-Jupyter-AWS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nchammas / flintrock
View on GitHub
A command-line tool for launching Apache Spark clusters.
☆651Dec 13, 2024Updated last year
soumith / mltrain-nips-2017
View on GitHub
This repository contains all the material for the MLTrain NIPS workshop
☆10Dec 9, 2017Updated 8 years ago
freeradical13 / ValueBasedPrioritization
View on GitHub
☆12Oct 13, 2020Updated 5 years ago
unnati-xyz / scalable-data-science-platform
View on GitHub
Content for architecting a data science platform for products using Luigi, Spark & Flask.
☆160Jan 27, 2020Updated 6 years ago
jmportilla / SQL-Appendix
View on GitHub
Appendix
☆14Apr 16, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
knathanieltucker / tf-keras-tutorial
View on GitHub
An introduction to tensorflow via keras
☆25Dec 14, 2017Updated 8 years ago
joelgrus / fun-with-trump-tweets
View on GitHub
code for Seattle Twitter-Dev Meetup, October 2016
☆13Oct 26, 2016Updated 9 years ago
IDEO-coLAB / machine-learning-resources
View on GitHub
Machine learning resources
☆13Feb 1, 2018Updated 8 years ago
maciejkula / binge
View on GitHub
Recommendation models that use binary rather than floating point operations at prediction time.
☆21Sep 18, 2017Updated 8 years ago
jwplayer / sparksteps
View on GitHub
CLI tool to launch Spark jobs on AWS EMR
☆67Oct 18, 2023Updated 2 years ago
amplab / spark-ec2
View on GitHub
Scripts used to setup a Spark cluster on EC2
☆388Nov 22, 2017Updated 8 years ago
jostmey / NakedTensor
View on GitHub
Bare bone examples of machine learning in TensorFlow
☆2,404Mar 14, 2017Updated 9 years ago
MrPowers / spark-spec
View on GitHub
Test suite to document the behavior of Spark
☆21Apr 15, 2021Updated 5 years ago
pythian / spark_streaming_percentile
View on GitHub
This is the repository for my blog post on calculating percentile on a streaming dataset using spark streaming.
☆10Nov 21, 2015Updated 10 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
msukmanowsky / pyspark-testing
View on GitHub
Unit and integration testing with PySpark can be tough to figure out, let's make that easier.
☆23Nov 3, 2015Updated 10 years ago
jvns / forestspy
View on GitHub
spy on your random forests
☆19Aug 20, 2020Updated 5 years ago
minrk / findspark
View on GitHub
☆525Mar 1, 2026Updated 4 months ago
logicx24 / DailyCalHousingAnalysis
View on GitHub
☆21Nov 2, 2016Updated 9 years ago
wroscoe / notebooks
View on GitHub
My computational narrative notebooks.
☆10Aug 13, 2018Updated 7 years ago
rasbt / PyMLSlides
View on GitHub
Slides for my machine learning course based on Sebastian Raschka's Python Machine Learning book
☆15Jun 22, 2018Updated 8 years ago
plaitpy / plaitpy
View on GitHub
plait.py - a fake data modeler
☆437Dec 27, 2018Updated 7 years ago
rlhotovy / lambda-numba
View on GitHub
A small demo showing how to compile Numba kernels for use on AWS Lambda
☆13Nov 7, 2016Updated 9 years ago
zachcp / simplecomponent
View on GitHub
a simple component to show use of D3 from Reagent
☆14Dec 21, 2015Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kelproject / kel-identity
View on GitHub
centralized authentication/authorization for Kel
☆11Aug 23, 2016Updated 9 years ago
josteink / autoarchiver
View on GitHub
A simple system for archiving and OCRing documents built for cloud-friendly search and backup.
☆23Dec 9, 2020Updated 5 years ago
ActivisionGameScience / python-kafka-benchmark
View on GitHub
☆15Jun 15, 2016Updated 10 years ago
GalvanizeDataScience / building-spark-applications-live-lessons
View on GitHub
Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…
☆68Jan 8, 2016Updated 10 years ago
astorfi / TensorFlow-World
View on GitHub
Simple and ready-to-use tutorials for TensorFlow
☆4,492Dec 23, 2020Updated 5 years ago
airbnb / knowledge-repo
View on GitHub
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
☆5,539Sep 4, 2024Updated last year
reiinakano / xcessiv
View on GitHub
A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.
☆1,265Jun 6, 2018Updated 8 years ago
reiinakano / scikit-plot
View on GitHub
An intuitive library to add plotting functionality to scikit-learn objects.
☆2,433Aug 20, 2024Updated last year
facebookarchive / bootstrapped
View on GitHub
Generate bootstrapped confidence intervals for A/B testing in Python.
☆637Nov 11, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
oscaro / maximator
View on GitHub
Thin Clojure wrapper around MaxMind GeoIP2 for IP geolocalization
☆15Jul 6, 2023Updated 3 years ago
movchan74 / tensorflow_serving_examples
View on GitHub
☆12May 11, 2018Updated 8 years ago
dziganto / dziganto.github.io
View on GitHub
☆25Jun 25, 2018Updated 8 years ago
dodger487 / dplython
View on GitHub
dplyr for python
☆761Dec 30, 2016Updated 9 years ago
agrawal-priyank / machine-learning-clustering-retrieval
View on GitHub
Built text and image clustering models using unsupervised machine learning algorithms such as nearest neighbors, k means, LDA , and used …
☆19Jan 20, 2018Updated 8 years ago
donnemartin / data-science-ipython-notebooks
View on GitHub
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce,…
☆29,259Mar 20, 2024Updated 2 years ago
rhiever / datacleaner
View on GitHub
A Python tool that automatically cleans data sets and readies them for analysis.
☆1,082May 22, 2019Updated 7 years ago