This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆116Aug 8, 2024Updated last year
Alternatives and similar repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
Users that are interested in BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆32Jul 12, 2015Updated 10 years ago
- Course materials for Stat 20 and Stat 131A, Spring 2017, at UC Berkeley☆17May 21, 2017Updated 8 years ago
- Hands-on examples showcasing popular NLP applications☆19Aug 23, 2019Updated 6 years ago
- Slides, material and solutions of the popular Statistical Learning course from Stanford's own Hastie & Tibshirani. Join me on my journey …☆16Mar 9, 2018Updated 8 years ago
- ☆20Aug 20, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The art of effective visualization of multi-dimensional data☆166Oct 7, 2018Updated 7 years ago
- Recipe for Spanish POS tagging using the CESS corpus with NLTK☆18Sep 28, 2016Updated 9 years ago
- Course materials for Stat 154, spring 2018, at UC Berkeley☆27Nov 15, 2018Updated 7 years ago
- Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…☆11Sep 14, 2015Updated 10 years ago
- A collection of course materials, notes and assignments for the Masters of Information and Data Sciences program at UC Berkeley☆15Dec 15, 2019Updated 6 years ago
- Spark Projects for the Berkeley Data Science Course☆13Aug 12, 2015Updated 10 years ago
- A repository to keep all open sources projects that created by individuals or study groups.☆25Nov 22, 2022Updated 3 years ago
- ☆12Sep 20, 2016Updated 9 years ago
- ☆11May 8, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- Material for Machine Learning Meetup "Machine Learning with Scikit-learn"☆29Jan 21, 2016Updated 10 years ago
- A curated list of Ionic Framework resources, components, libraries, and snippets.☆15May 8, 2017Updated 8 years ago
- A command line app to compare users on different platforms.☆12May 21, 2017Updated 8 years ago
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 9 years ago
- Latent Dirichlet Allocation on tweets☆15May 17, 2015Updated 10 years ago
- An ultra-simple example of how to use Python to write stories based on a set of data.☆29Sep 12, 2013Updated 12 years ago
- Materials for my SciPy2013 tutorial on NumPy and IPython☆33Aug 19, 2013Updated 12 years ago
- ☆37May 27, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for Computer Architecture Class at UC Berkeley☆10Nov 25, 2019Updated 6 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- Flappy Bird Automation using RL and Servo☆30Sep 2, 2016Updated 9 years ago
- An offline IDE for C++, although similar to ideone.com, but ensures that your code doesn't fall into wrong hands :p☆16Feb 18, 2016Updated 10 years ago
- Course materials for Stat 133, fall 2016, at UC Berkeley☆15Feb 16, 2017Updated 9 years ago
- Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! Thi…☆1,694Dec 24, 2020Updated 5 years ago
- A PHP library to create real-time web applications without expensive server and websocket☆10Aug 5, 2016Updated 9 years ago
- For the pandas tutorial at PyData Seattle: https://www.youtube.com/watch?v=otCriSKVV_8☆116Oct 21, 2021Updated 4 years ago
- My competitions approach☆18Jan 13, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Notes for the courses in the Machine Learning Specialization created by the University of Washingtion on Coursera☆45Dec 29, 2020Updated 5 years ago
- Probability and Statistics in Data Science [Python] [Complete]☆11Oct 22, 2018Updated 7 years ago
- EDA Tutorial for 2017 PyCon Portland☆13May 2, 2017Updated 8 years ago
- UCSD Big Data Specialization General Materials and my Capstone Project.☆21Apr 6, 2018Updated 8 years ago
- A basic introduction to machine learning (one day training).☆16Nov 23, 2017Updated 8 years ago
- A collection of Python scripts☆12Feb 7, 2020Updated 6 years ago
- ☆40Sep 3, 2015Updated 10 years ago