This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆115Aug 8, 2024Updated last year
Alternatives and similar repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
Users that are interested in BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆31Jul 12, 2015Updated 10 years ago
- Course materials for Stat 20 and Stat 131A, Spring 2017, at UC Berkeley☆17May 21, 2017Updated 9 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 4 months ago
- Slides, material and solutions of the popular Statistical Learning course from Stanford's own Hastie & Tibshirani. Join me on my journey …☆16Mar 9, 2018Updated 8 years ago
- Code repository for Fast Data Processing Systems with SMACK Stack by Packt☆18Jan 18, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆20Aug 20, 2016Updated 9 years ago
- The art of effective visualization of multi-dimensional data☆167Oct 7, 2018Updated 7 years ago
- Course materials for Stat 154, spring 2018, at UC Berkeley☆26Nov 15, 2018Updated 7 years ago
- Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…☆11Sep 14, 2015Updated 10 years ago
- Code & Data for V3 of the Fast data Processing with Spark 2 book☆15Sep 26, 2016Updated 9 years ago
- A collection of course materials, notes and assignments for the Masters of Information and Data Sciences program at UC Berkeley☆15Dec 15, 2019Updated 6 years ago
- Spark Projects for the Berkeley Data Science Course☆13Aug 12, 2015Updated 10 years ago
- Data and Notebook for medium blog post☆20Aug 31, 2019Updated 6 years ago
- Material for Machine Learning Meetup "Machine Learning with Scikit-learn"☆29Jan 21, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- This repository contains code snippets discussed in 15-440, lecture 5 (given on 1/28/2014).☆11Jan 30, 2014Updated 12 years ago
- ☆12Sep 4, 2017Updated 8 years ago
- A curated list of Ionic Framework resources, components, libraries, and snippets.☆15May 8, 2017Updated 9 years ago
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 9 years ago
- Latent Dirichlet Allocation on tweets☆15May 17, 2015Updated 11 years ago
- An ultra-simple example of how to use Python to write stories based on a set of data.☆29Sep 12, 2013Updated 12 years ago
- ☆38May 27, 2025Updated last year
- Repository for Computer Architecture Class at UC Berkeley☆10Nov 25, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Learn how to build NPL Cognitive Chatbots☆25Jan 28, 2020Updated 6 years ago
- Additional useful algorithms that can be used with spark.☆12Feb 2, 2015Updated 11 years ago
- Matlab implementation of TCK☆12Jul 5, 2019Updated 6 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- ☆18Sep 14, 2019Updated 6 years ago
- Course materials for Stat 133, fall 2016, at UC Berkeley☆15Feb 16, 2017Updated 9 years ago
- Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! Thi…☆1,694Dec 24, 2020Updated 5 years ago
- A PHP library to create real-time web applications without expensive server and websocket☆11Aug 5, 2016Updated 9 years ago
- For the pandas tutorial at PyData Seattle: https://www.youtube.com/watch?v=otCriSKVV_8☆115Oct 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- My competitions approach☆18Jan 13, 2022Updated 4 years ago
- Probability and Statistics in Data Science [Python] [Complete]☆11Oct 22, 2018Updated 7 years ago
- EDA Tutorial for 2017 PyCon Portland☆13May 2, 2017Updated 9 years ago
- A basic introduction to machine learning (one day training).☆16Nov 23, 2017Updated 8 years ago
- The repository for the CMU Data Pipeline course. This year's course should use branch 2017☆40May 2, 2017Updated 9 years ago
- Python Tips for Data Scientist☆27Feb 19, 2022Updated 4 years ago
- Scripts for capturing tweets, creating data dictionary, processing & scoring tweet sentiments☆11Aug 24, 2015Updated 10 years ago