Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable Spark applications for predictive analytics in the context of a data scientist's standard workflow.
☆68Jan 8, 2016Updated 10 years ago
Alternatives and similar repositories for building-spark-applications-live-lessons
Users that are interested in building-spark-applications-live-lessons are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Materials for "Teaching the Tidyverse" January 2019 edition☆27Mar 6, 2019Updated 7 years ago
- Bokeh tutorial, PyData Berlin☆10May 29, 2015Updated 11 years ago
- Spark Training Exercises☆25May 11, 2016Updated 10 years ago
- repository for code related to the end-to-end data analysis in python workshop, from the Open Data Science Conference 2015☆15Nov 8, 2015Updated 10 years ago
- ☆24Feb 27, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Apr 18, 2019Updated 7 years ago
- In this sample i integrate a play framework app (java) with akka cluster so that you can easily add new play node to scale your system. w…☆22Jul 20, 2015Updated 10 years ago
- Coding exercises for Apache Spark☆103Jun 4, 2015Updated 11 years ago
- Repository for code/examples/instructions for the MIT course 15.S60 "Software Tools for Operations Research"☆25Aug 14, 2014Updated 11 years ago
- AWS Big Data Certification☆25Mar 26, 2026Updated 3 months ago
- For the pandas tutorial at PyData Seattle: https://www.youtube.com/watch?v=otCriSKVV_8☆115Oct 21, 2021Updated 4 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆77Feb 17, 2011Updated 15 years ago
- Python data analysis course for 2017 NGCM Summer Academy☆21Jun 28, 2017Updated 9 years ago
- SmallK: very fast data clustering tools☆13Apr 3, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Hands-On-Predictive-Analytics-with-Python☆15Jan 15, 2021Updated 5 years ago
- Python bindings for Stanford CoreNLP's protobufs.☆20Jul 23, 2018Updated 7 years ago
- Doing the tutorial from Brandon Rhodes PyCon 2015☆16Jun 19, 2015Updated 11 years ago
- ☆10Jun 28, 2015Updated 11 years ago
- C/C++ Algorithms Implementation for Code In☆14Nov 15, 2015Updated 10 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52May 13, 2016Updated 10 years ago
- Recipe for running a docker registry inside Kubernetes☆11Jan 2, 2017Updated 9 years ago
- Use Vagrant to manage your EC2 and VPC instances.☆10May 10, 2016Updated 10 years ago
- Docker files for the example code in Big Data for Chimps☆20May 19, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Documentation on using the built-in Python debugger, PDB.☆24Dec 8, 2022Updated 3 years ago
- Groovy client library for Apache Ambari's REST API☆20Jun 25, 2021Updated 5 years ago
- Contains Python code for my Imbalanced-Classification training course!☆37Jan 6, 2021Updated 5 years ago
- A list of all projects by UW CSE students.☆10Feb 8, 2016Updated 10 years ago
- CommonRegex port for Java☆20Jan 18, 2014Updated 12 years ago
- PyData Boston 2013 talks: "Intro to scikit-learn" & "Realtime Predictive Analytics: Using scikit-learn and RabbitMQ"☆11Jan 5, 2014Updated 12 years ago
- "Reactive Akka" presentation☆30Aug 19, 2015Updated 10 years ago
- 用户行为分析系统☆12Dec 10, 2015Updated 10 years ago
- Apache Solr TextField with docValues support☆11Mar 24, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tutorial for Funcitonal Python tutorial at PyData-NYC 2013☆48Mar 28, 2014Updated 12 years ago
- Slides, notes, and code for KotlinConf talk on Data Science☆19Mar 5, 2018Updated 8 years ago
- Code & Data for V3 of the Fast data Processing with Spark 2 book☆15Sep 26, 2016Updated 9 years ago
- ☆19Feb 16, 2024Updated 2 years ago
- argo-cron☆14Feb 17, 2020Updated 6 years ago
- A Play-based server implementing the PSI machine learning API☆14Dec 16, 2013Updated 12 years ago
- Redshift Python library for user agent detection (browsers, devices, etc) and parsing via UDFs☆10May 27, 2020Updated 6 years ago