This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆115Aug 8, 2024Updated last year
Alternatives and similar repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
Users that are interested in BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆31Jul 12, 2015Updated 10 years ago
- Hands-on examples showcasing popular NLP applications☆19Aug 23, 2019Updated 6 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 4 months ago
- Code repository for Fast Data Processing Systems with SMACK Stack by Packt☆18Jan 18, 2023Updated 3 years ago
- ☆20Aug 20, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Recipe for Spanish POS tagging using the CESS corpus with NLTK☆18Sep 28, 2016Updated 9 years ago
- Course materials for Stat 154, spring 2018, at UC Berkeley☆26Nov 15, 2018Updated 7 years ago
- Code & Data for V3 of the Fast data Processing with Spark 2 book☆15Sep 26, 2016Updated 9 years ago
- A collection of course materials, notes and assignments for the Masters of Information and Data Sciences program at UC Berkeley☆15Dec 15, 2019Updated 6 years ago
- Data and Notebook for medium blog post☆20Aug 31, 2019Updated 6 years ago
- Material for Machine Learning Meetup "Machine Learning with Scikit-learn"☆29Jan 21, 2016Updated 10 years ago
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 9 years ago
- Performance Benchmarks☆21Oct 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Latent Dirichlet Allocation on tweets☆15May 17, 2015Updated 11 years ago
- Materials for my SciPy2013 tutorial on NumPy and IPython☆33Aug 19, 2013Updated 12 years ago
- Learn how to build NPL Cognitive Chatbots☆25Jan 28, 2020Updated 6 years ago
- Matlab implementation of TCK☆12Jul 5, 2019Updated 6 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- Course materials for Stat 133, fall 2016, at UC Berkeley☆15Feb 16, 2017Updated 9 years ago
- Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! Thi…☆1,694Dec 24, 2020Updated 5 years ago
- For the pandas tutorial at PyData Seattle: https://www.youtube.com/watch?v=otCriSKVV_8☆116Oct 21, 2021Updated 4 years ago
- ☆10Feb 13, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Cheat Sheets☆11Aug 31, 2019Updated 6 years ago
- This repository holds all course materials for the fall 2014 offering of Statistics 243 at UC Berkeley.☆15Sep 9, 2015Updated 10 years ago
- EDA Tutorial for 2017 PyCon Portland☆13May 2, 2017Updated 9 years ago
- ☆40Sep 3, 2015Updated 10 years ago
- Notes and code for learning Random Forests☆13Nov 17, 2022Updated 3 years ago
- Scripts for capturing tweets, creating data dictionary, processing & scoring tweet sentiments☆11Aug 24, 2015Updated 10 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Sep 16, 2014Updated 11 years ago
- Hands-On-Predictive-Analytics-with-Python☆15Jan 15, 2021Updated 5 years ago
- Course of Machine Learning in Science and Industry at Heidelberg university☆46Apr 15, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Repository for the PyData DC 2016 tutorial☆29Nov 12, 2016Updated 9 years ago
- ☆26Nov 9, 2019Updated 6 years ago
- 数据挖掘管道搭建示例 基于大航杯“智造扬中”电力AI大赛☆15Jun 13, 2017Updated 8 years ago
- Solutions to all SRMS Division II 250 and 500 point problems☆13Mar 22, 2014Updated 12 years ago
- Slides, code and resources for model interpretation methods in machine learning and deep learning☆33Aug 8, 2019Updated 6 years ago
- Identify the emotion (neutral, anger, contempt, disgust, fear, happy, sadness, surprise) in a given static image☆14Oct 23, 2019Updated 6 years ago
- Fast-Data-Processing-with-Spark-2☆22Jan 18, 2023Updated 3 years ago