Quickstart PySpark with Anaconda on AWS/EMR
☆53Jan 9, 2017Updated 9 years ago
Alternatives and similar repositories for emr-bootstrap-pyspark
Users that are interested in emr-bootstrap-pyspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- Resources and Materials for MATLAB Probability class☆10Oct 23, 2015Updated 10 years ago
- Access Parse.ly raw data via Amazon S3 and Kinesis☆11Jan 27, 2023Updated 3 years ago
- Code for my paper "Fixed-Form Variational Posterior Approximation through Stochastic Linear Regression"☆11Sep 15, 2013Updated 12 years ago
- The open source version of the Application Auto Scaling User Guide. To submit feedback or requests for changes, submit an issue or make c…☆11Jun 15, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can conta…☆24Oct 18, 2019Updated 6 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl☆71Mar 17, 2023Updated 3 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆48Jan 7, 2025Updated last year
- ☆15Feb 12, 2019Updated 7 years ago
- ☆10May 11, 2019Updated 7 years ago
- Toolkit for Visual Studio is a plugin for the Visual Studio IDE.☆10Jun 15, 2023Updated 3 years ago
- Hands-on Learning with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + Kubernetes + Airflow + Jupyter☆11Oct 28, 2022Updated 3 years ago
- A tutorial demonstrating how to implement deep learning models for time series forecasting☆12Jan 28, 2020Updated 6 years ago
- Streamlit ToDo CRUD App☆28Jun 25, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An R package to gather, munge, and convert event datasets into temporal event-networks.☆11Mar 28, 2018Updated 8 years ago
- Boilerplate code for flask apps on PCF that interact with a backend environment (ex: Pivotal BDS or ElephantSQL).☆12Sep 1, 2016Updated 9 years ago
- aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-firs…☆11Jun 10, 2015Updated 11 years ago
- An opinionated CLI tool for Python monorepo MGMT (Work in Progress)☆26Jul 6, 2020Updated 5 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Oct 30, 2018Updated 7 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Oct 17, 2018Updated 7 years ago
- csvcat☆22Feb 23, 2016Updated 10 years ago
- An Introduction to Reinforcement Learning☆16Nov 30, 2017Updated 8 years ago
- Course materials, IPython notebooks, tutorials, and guides for the urban data science course☆12Mar 27, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- User-friendly Teradata client for Python☆107Nov 17, 2021Updated 4 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Mar 1, 2018Updated 8 years ago
- Appendix☆14Apr 16, 2015Updated 11 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- Deep Learning Tutorial notes and code. See the wiki for more info.☆10Jun 22, 2015Updated 11 years ago
- My work on UCSD CSE 250B Principles of Artificial Intelligence: Learning Algorithms☆13Jul 24, 2019Updated 6 years ago
- Quantitative analysis for traders on Oslo Stock Exchange. Download, plot and play with data from Oslo Børs and Nasdaq OMX☆10Jul 28, 2018Updated 7 years ago
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- API REST boilerplate using Spring Boot and Redis as database☆13Dec 26, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- Lab for Jagiellonian University course☆10Jul 2, 2016Updated 9 years ago
- A PyTorch implementation of DenseNet, supporting multiclass and multilabel classification.☆24Aug 11, 2017Updated 8 years ago
- ☆12Feb 19, 2020Updated 6 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 9 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago
- ☆22Jun 19, 2020Updated 6 years ago