Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable Spark applications for predictive analytics in the context of a data scientist's standard workflow.
☆68Jan 8, 2016Updated 10 years ago
Alternatives and similar repositories for building-spark-applications-live-lessons
Users that are interested in building-spark-applications-live-lessons are comparing it to the libraries listed below
Sorting:
- Slides and code for "Validating Models in R" Strata 2016 RDay http://conferences.oreilly.com/strata/hadoop-big-data-ca/public/schedule/de…☆10Jun 22, 2020Updated 5 years ago
- Bokeh tutorial, PyData Berlin☆10May 29, 2015Updated 10 years ago
- Materials for "Teaching the Tidyverse" January 2019 edition☆27Mar 6, 2019Updated 6 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- repository for code related to the end-to-end data analysis in python workshop, from the Open Data Science Conference 2015☆15Nov 8, 2015Updated 10 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- In this sample i integrate a play framework app (java) with akka cluster so that you can easily add new play node to scale your system. w…☆23Jul 20, 2015Updated 10 years ago
- Python bindings for Stanford CoreNLP's protobufs.☆20Jul 23, 2018Updated 7 years ago
- Repository for code/examples/instructions for the MIT course 15.S60 "Software Tools for Operations Research"☆25Aug 14, 2014Updated 11 years ago
- Slides, notes, and code for KotlinConf talk on Data Science☆19Mar 5, 2018Updated 8 years ago
- ☆24Feb 27, 2018Updated 8 years ago
- Python data analysis course for 2017 NGCM Summer Academy☆21Jun 28, 2017Updated 8 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Feb 17, 2011Updated 15 years ago
- Data Exploration with Apache Drill☆26Mar 5, 2020Updated 6 years ago
- ☆14Apr 6, 2023Updated 2 years ago
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Apr 10, 2024Updated last year
- A full-stack AI chat application powered by Amazon Bedrock and Strands Agents☆46Feb 14, 2026Updated 2 weeks ago
- Homework submissions for Harvard's CS171: Visualization☆16Mar 30, 2015Updated 10 years ago
- Visualize statistics from the MOOC "Functional Programming Principles in Scala" using Scala!☆202Mar 31, 2014Updated 11 years ago
- Apache Zeppelin on Kubernetes.☆28Apr 23, 2019Updated 6 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆26Oct 16, 2018Updated 7 years ago
- "Reactive Akka" presentation☆30Aug 19, 2015Updated 10 years ago
- python notebooks for revising machine learning basics☆28Jun 26, 2018Updated 7 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆29May 31, 2018Updated 7 years ago
- Spark GCE Script Helps you deploy Spark cluster on Google Cloud.☆43May 30, 2015Updated 10 years ago
- Contains Python code for my Imbalanced-Classification training course!☆37Jan 6, 2021Updated 5 years ago
- Notebooks containing R code from Richard McElreath's Statistical Rethinking☆72Feb 15, 2016Updated 10 years ago
- Tail a log file and send log lines automatically to a kafka topic☆57Jun 17, 2012Updated 13 years ago
- Scala: The Unpredicted Lingua Franca for Data Science☆129Dec 14, 2018Updated 7 years ago
- VariantSpark is a framework for applying Spark-based Machine Learning methods to whole-genome variant information☆33Sep 28, 2017Updated 8 years ago
- Relief Based Algorithms of ReBATE implemented in Python with Cython optimization. This repository is no longer being updated. Please see…☆33May 22, 2018Updated 7 years ago
- datasets from my cyber security research papers☆10Jan 12, 2021Updated 5 years ago
- Small utility that loads any downloaded JSON databases from www.phishtank.com into Redis cache for quick local queries☆11Aug 8, 2016Updated 9 years ago
- Materials for my PyData Boston 2013 talk☆15Sep 26, 2013Updated 12 years ago
- algorithm study☆13Updated this week
- ☆11Jan 4, 2017Updated 9 years ago
- GUI to handle job queues in Abaqus☆14Nov 15, 2018Updated 7 years ago
- ☆12Nov 17, 2020Updated 5 years ago
- Face detection with Python☆11Aug 8, 2019Updated 6 years ago