This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆116Aug 8, 2024Updated last year
Alternatives and similar repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
Users that are interested in BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark are comparing it to the libraries listed below
Sorting:
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆32Jul 12, 2015Updated 10 years ago
- Course materials for Stat 20 and Stat 131A, Spring 2017, at UC Berkeley☆17May 21, 2017Updated 8 years ago
- Hands-on examples showcasing popular NLP applications☆19Aug 23, 2019Updated 6 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated last month
- Slides, material and solutions of the popular Statistical Learning course from Stanford's own Hastie & Tibshirani. Join me on my journey …☆16Mar 9, 2018Updated 8 years ago
- ☆20Aug 20, 2016Updated 9 years ago
- The art of effective visualization of multi-dimensional data☆166Oct 7, 2018Updated 7 years ago
- Recipe for Spanish POS tagging using the CESS corpus with NLTK☆18Sep 28, 2016Updated 9 years ago
- Code & Data for V3 of the Fast data Processing with Spark 2 book☆15Sep 26, 2016Updated 9 years ago
- A collection of course materials, notes and assignments for the Masters of Information and Data Sciences program at UC Berkeley☆14Dec 15, 2019Updated 6 years ago
- Data and Notebook for medium blog post☆20Aug 31, 2019Updated 6 years ago
- ☆12Sep 20, 2016Updated 9 years ago
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- Material for Machine Learning Meetup "Machine Learning with Scikit-learn"☆29Jan 21, 2016Updated 10 years ago
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- ☆12Sep 4, 2017Updated 8 years ago
- A command line app to compare users on different platforms.☆12May 21, 2017Updated 8 years ago
- Performance Benchmarks☆21Oct 24, 2024Updated last year
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 8 years ago
- Latent Dirichlet Allocation on tweets☆15May 17, 2015Updated 10 years ago
- An ultra-simple example of how to use Python to write stories based on a set of data.☆29Sep 12, 2013Updated 12 years ago
- ☆37May 27, 2025Updated 9 months ago
- Matlab implementation of TCK☆12Jul 5, 2019Updated 6 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- Flappy Bird Automation using RL and Servo☆30Sep 2, 2016Updated 9 years ago
- An offline IDE for C++, although similar to ideone.com, but ensures that your code doesn't fall into wrong hands :p☆16Feb 18, 2016Updated 10 years ago
- Course materials for Stat 133, fall 2016, at UC Berkeley☆15Feb 16, 2017Updated 9 years ago
- Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! Thi…☆1,691Dec 24, 2020Updated 5 years ago
- A PHP library to create real-time web applications without expensive server and websocket☆10Aug 5, 2016Updated 9 years ago
- For the pandas tutorial at PyData Seattle: https://www.youtube.com/watch?v=otCriSKVV_8☆116Oct 21, 2021Updated 4 years ago
- My competitions approach☆18Jan 13, 2022Updated 4 years ago
- A basic introduction to machine learning (one day training).☆16Nov 23, 2017Updated 8 years ago
- EDA Tutorial for 2017 PyCon Portland☆13May 2, 2017Updated 8 years ago
- A collection of Python scripts☆12Feb 7, 2020Updated 6 years ago
- Notes and code for learning Random Forests☆12Nov 17, 2022Updated 3 years ago
- Scripts for capturing tweets, creating data dictionary, processing & scoring tweet sentiments☆11Aug 24, 2015Updated 10 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Sep 16, 2014Updated 11 years ago
- Hands-On-Predictive-Analytics-with-Python☆15Jan 15, 2021Updated 5 years ago
- Multi-armed bandits for dynamic movie recommendations☆14Nov 20, 2019Updated 6 years ago