This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆116Aug 8, 2024Updated last year
Alternatives and similar repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
Users that are interested in BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆32Jul 12, 2015Updated 10 years ago
- Course materials for Stat 20 and Stat 131A, Spring 2017, at UC Berkeley☆17May 21, 2017Updated 8 years ago
- Hands-on examples showcasing popular NLP applications☆19Aug 23, 2019Updated 6 years ago
- ☆20Aug 20, 2016Updated 9 years ago
- The art of effective visualization of multi-dimensional data☆166Oct 7, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Course materials for Stat 154, spring 2018, at UC Berkeley☆27Nov 15, 2018Updated 7 years ago
- Code & Data for V3 of the Fast data Processing with Spark 2 book☆15Sep 26, 2016Updated 9 years ago
- A collection of course materials, notes and assignments for the Masters of Information and Data Sciences program at UC Berkeley☆15Dec 15, 2019Updated 6 years ago
- Spark Projects for the Berkeley Data Science Course☆13Aug 12, 2015Updated 10 years ago
- Data and Notebook for medium blog post☆20Aug 31, 2019Updated 6 years ago
- ☆12Sep 20, 2016Updated 9 years ago
- ☆11May 8, 2016Updated 9 years ago
- Material for Machine Learning Meetup "Machine Learning with Scikit-learn"☆29Jan 21, 2016Updated 10 years ago
- ☆12Sep 4, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A command line app to compare users on different platforms.☆12May 21, 2017Updated 8 years ago
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 8 years ago
- Latent Dirichlet Allocation on tweets☆15May 17, 2015Updated 10 years ago
- An ultra-simple example of how to use Python to write stories based on a set of data.☆29Sep 12, 2013Updated 12 years ago
- Materials for my SciPy2013 tutorial on NumPy and IPython☆33Aug 19, 2013Updated 12 years ago
- Matlab implementation of TCK☆12Jul 5, 2019Updated 6 years ago
- Flappy Bird Automation using RL and Servo☆30Sep 2, 2016Updated 9 years ago
- An offline IDE for C++, although similar to ideone.com, but ensures that your code doesn't fall into wrong hands :p☆16Feb 18, 2016Updated 10 years ago
- Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! Thi…☆1,691Dec 24, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- My competitions approach☆18Jan 13, 2022Updated 4 years ago
- Cheat Sheets☆11Aug 31, 2019Updated 6 years ago
- ☆10Feb 13, 2024Updated 2 years ago
- This repository holds all course materials for the fall 2014 offering of Statistics 243 at UC Berkeley.☆15Sep 9, 2015Updated 10 years ago
- Probability and Statistics in Data Science [Python] [Complete]☆11Oct 22, 2018Updated 7 years ago
- A collection of Python scripts☆12Feb 7, 2020Updated 6 years ago
- This is the code for "What is a Blockchain Smart Contract?" by Siraj Raval on Youtube☆25Oct 23, 2017Updated 8 years ago
- Notes and code for learning Random Forests☆12Nov 17, 2022Updated 3 years ago
- Scripts for capturing tweets, creating data dictionary, processing & scoring tweet sentiments☆11Aug 24, 2015Updated 10 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Sep 16, 2014Updated 11 years ago
- This repository holds all course materials for the fall 2018 offering of Statistics 243 at UC Berkeley.☆17Sep 5, 2019Updated 6 years ago
- Multi-armed bandits for dynamic movie recommendations☆14Nov 20, 2019Updated 6 years ago
- Course of Machine Learning in Science and Industry at Heidelberg university☆47Apr 15, 2017Updated 8 years ago
- Repository for the PyData DC 2016 tutorial☆29Nov 12, 2016Updated 9 years ago
- A chrome extension to get the meaning of the selected word instantly.☆20Feb 27, 2018Updated 8 years ago
- 数据挖掘管道搭建示例 基于大航杯“智造扬中”电力AI大赛☆15Jun 13, 2017Updated 8 years ago