Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
☆34Apr 3, 2017Updated 9 years ago
Alternatives and similar repositories for DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD
Users that are interested in DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).☆235Mar 8, 2023Updated 3 years ago
- ☆12Apr 27, 2018Updated 8 years ago
- Python tutorials and puzzles to share with the world!☆170Sep 28, 2017Updated 8 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆16Jul 11, 2016Updated 9 years ago
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Contains solved programs for the HackerRank Problem Solving (Basics) Skill Test Certification 🎓.☆11Nov 1, 2020Updated 5 years ago
- At the time of exams most of the time student share their notes via social media and after the exam gets over it become really difficut t…☆14May 29, 2018Updated 8 years ago
- ☆14Jan 22, 2019Updated 7 years ago
- Road extraction with deep learning from high resolution satellite images.☆13Sep 16, 2021Updated 4 years ago
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s…☆29Nov 4, 2017Updated 8 years ago
- Currency Portfolio Optimization - IPython notebook and data☆26Dec 21, 2015Updated 10 years ago
- Code relating to the Coursera Bioinformatics Specialization as well as my own genetic algorithm experiment.☆11Apr 19, 2019Updated 7 years ago
- Computer Science, Data Science and ML Fundamentals☆11May 30, 2025Updated 11 months ago
- Analyzing Airline data to predict delays☆19May 15, 2014Updated 12 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Group project for the WorldQuant University module, risk management.☆13Feb 3, 2019Updated 7 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆155Feb 1, 2018Updated 8 years ago
- The fastai deep learning library, plus lessons and tutorials☆13Jun 2, 2019Updated 6 years ago
- Python tutorials in both Jupyter Notebook and youtube format.☆1,256Apr 17, 2026Updated last month
- ☆20Aug 20, 2016Updated 9 years ago
- Repo for the Deep Learning Nanodegree Foundations program.☆12Aug 1, 2017Updated 8 years ago
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 10 years ago
- Android app for spam and fake review detection.☆13Apr 11, 2023Updated 3 years ago
- Import Salesforce data into Hadoop HDFS in Avro format☆23Jan 8, 2020Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Slides, material and solutions of the popular Statistical Learning course from Stanford's own Hastie & Tibshirani. Join me on my journey …☆16Mar 9, 2018Updated 8 years ago
- Automatically predict age and gender in static image files and real-time video streams with reasonably high accuracy.☆10Oct 1, 2020Updated 5 years ago
- Detect your face in any image using openCV with dnn module☆12Oct 1, 2020Updated 5 years ago
- A repo for talk materials☆26Jun 22, 2020Updated 5 years ago
- HackerNews reader☆10Nov 13, 2015Updated 10 years ago
- Examples for the FORM+CODE book☆20Oct 2, 2015Updated 10 years ago
- Language Modelling, CMI vs Perplexity☆11Mar 17, 2018Updated 8 years ago
- Rough working notes on neural networks☆46Dec 12, 2013Updated 12 years ago
- A Watson powered conversational bot for small businesses☆16Nov 2, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of basic text processing modules focused on Gujarati☆10Oct 24, 2017Updated 8 years ago
- A simple example for PySpark based project.☆11Jun 3, 2016Updated 9 years ago
- Data structure and Algorithm with Python☆20Nov 3, 2021Updated 4 years ago
- View recent highly-rated albums in the terminal☆11Mar 20, 2023Updated 3 years ago
- Personal dotfiles, scripts, libraries, and documentation☆17Apr 13, 2026Updated last month
- Data mining algorithms with Python☆10Jun 26, 2019Updated 6 years ago
- copy of https://bitbucket.org/dirkbaechle/profile/repositories☆11Feb 15, 2021Updated 5 years ago