Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
☆34Apr 3, 2017Updated 9 years ago
Alternatives and similar repositories for DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD
Users that are interested in DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for my graduate data science machine learning class at UCSD (UC San Diego). This course provides a broad introduction to the practic…☆54Mar 26, 2018Updated 8 years ago
- Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)☆181Aug 21, 2017Updated 8 years ago
- ☆10May 4, 2019Updated 7 years ago
- This is the official repository for the paper "Words That Unite The World: A Unified Framework for Deciphering Global Central Bank Commun…☆21Oct 19, 2025Updated 8 months ago
- Distributed supply chain application (DApp) that uses a Solidity smart contract to track pharmaceuticals on the Ethereum blockchain.☆10Aug 1, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Interactive computing for complex data processing, modeling and analysis in Python 3☆79May 3, 2024Updated 2 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆16Jul 11, 2016Updated 9 years ago
- ☆18Aug 15, 2022Updated 3 years ago
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Repo for Coursera.com online course: Statistical Inference☆10Aug 1, 2014Updated 11 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- At the time of exams most of the time student share their notes via social media and after the exam gets over it become really difficut t…☆14May 29, 2018Updated 8 years ago
- Social Media Analysis, scalable solution, flexible deployment that analyses social media contents☆10Jul 20, 2023Updated 2 years ago
- Resilient Automation Functions and Scripts☆15Jan 5, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Run dynamic SQL in SQL. This package allows queries with an unknown number of select-list items and can solve challenging problems like d…☆12Oct 5, 2024Updated last year
- Coursera Quiz Solutions☆11Aug 11, 2022Updated 3 years ago
- Houses price prediction web app☆11Feb 20, 2026Updated 3 months ago
- Currency Portfolio Optimization - IPython notebook and data☆26Dec 21, 2015Updated 10 years ago
- Public GitHub repo for SciPy 2022 tutorial (Introduction to Numerical Computing With NumPy)☆13Aug 24, 2022Updated 3 years ago
- Python scripts to facilitate easy working☆11Mar 23, 2026Updated 2 months ago
- Computer Science, Data Science and ML Fundamentals☆11May 30, 2025Updated last year
- Analyzing Airline data to predict delays☆19May 15, 2014Updated 12 years ago
- Group project for the WorldQuant University module, risk management.☆13Feb 3, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆155Feb 1, 2018Updated 8 years ago
- Materials and code relating to Learning Intelligence 25.☆11Mar 23, 2018Updated 8 years ago
- ☆17May 16, 2020Updated 6 years ago
- Solutions to Database-SQL course by Stanford University☆17Mar 1, 2019Updated 7 years ago
- Repo for the Deep Learning Nanodegree Foundations program.☆12Aug 1, 2017Updated 8 years ago
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 10 years ago
- AngularJS library for working with Google Maps☆20Mar 15, 2016Updated 10 years ago
- ☆13Nov 2, 2020Updated 5 years ago
- Road Extraction from Satellite Images with an ensemble of 3 U-Nets☆12Dec 20, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Slides, material and solutions of the popular Statistical Learning course from Stanford's own Hastie & Tibshirani. Join me on my journey …☆16Mar 9, 2018Updated 8 years ago
- All of my projects from the Udacity Deep Learning Foundations Nanodegree.☆12Aug 17, 2017Updated 8 years ago
- ☆15Feb 20, 2026Updated 3 months ago
- Sends public ip through e-mail. Command-line standalone.☆16Oct 16, 2016Updated 9 years ago
- Language Modelling, CMI vs Perplexity☆11Mar 17, 2018Updated 8 years ago
- A collection of basic text processing modules focused on Gujarati☆10Oct 24, 2017Updated 8 years ago
- A simple example for PySpark based project.☆11Jun 3, 2016Updated 10 years ago