bdanalytics / Berkeley-SparkLinks
edXSpark
☆21Updated 9 years ago
Alternatives and similar repositories for Berkeley-Spark
Users that are interested in Berkeley-Spark are comparing it to the libraries listed below
Sorting:
- A simple implementation of k-means clustering on the Spark cluster computing framework. See http://cs.berkeley.edu/~matei/spark.☆27Updated 14 years ago
- Cheatsheet for Spark DataFrame☆91Updated 6 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Updated 5 years ago
- Spark 2.0 Scala Machine Learning examples☆78Updated 6 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆68Updated 10 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆32Updated 10 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆48Updated 10 years ago
- Code examples and docker environment for Spark☆28Updated 9 years ago
- Installation guide for Apache Spark + Hadoop on Mac/Linux☆60Updated 8 years ago
- Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational f…☆168Updated 5 months ago
- Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL☆211Updated 3 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 10 years ago
- Demo notebooks inside a docker for end-to-end examples☆112Updated 7 years ago
- All materials for workshops - HackOn(Data) - Toronto☆33Updated 8 years ago
- Scala for Statistical Computing and Data Science Short Course☆136Updated 5 years ago
- Source material for Data Science for Telecom Tutorial at Strata Singapore 2015☆102Updated 9 years ago
- ☆12Updated 9 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆116Updated last year
- Learn the pyspark API through pictures and simple examples☆170Updated 5 years ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆346Updated 4 years ago
- Materials for the "Advanced Scikit-learn" class in the afternoon☆166Updated 7 years ago
- Analyzing NBA data using Spark 2.1☆47Updated 9 years ago
- A short course on the new, experimental features by The Data Incubator and O'Reilly Strata.☆16Updated 9 years ago
- Code from the book Machine Learning Systems☆145Updated 7 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 7 years ago
- An introduction to implementing a number of scikit-learn classifiers, along with some data exploration☆102Updated 10 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆78Updated 7 years ago