mGalarnyk / DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSDView external linksLinks
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
☆34Apr 3, 2017Updated 8 years ago
Alternatives and similar repositories for DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD
Users that are interested in DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD are comparing it to the libraries listed below
Sorting:
- Repo for my graduate data science machine learning class at UCSD (UC San Diego). This course provides a broad introduction to the practic…☆54Mar 26, 2018Updated 7 years ago
- Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)☆182Aug 21, 2017Updated 8 years ago
- Database Management Systems Data Science Masters Course (DSE 201)☆12Jun 26, 2016Updated 9 years ago
- ☆10May 4, 2019Updated 6 years ago
- This is a general purpose wrapper for converting Datalog queries to Neo4J graph database☆10Dec 9, 2016Updated 9 years ago
- Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).☆236Mar 8, 2023Updated 2 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆16Jul 11, 2016Updated 9 years ago
- Coursera machine learning specialization coursework (python based, University of Washington).☆18Mar 28, 2016Updated 9 years ago
- Currency Portfolio Optimization - IPython notebook and data☆26Dec 21, 2015Updated 10 years ago
- ☆18Aug 15, 2022Updated 3 years ago
- This is the official repository for the paper "Words That Unite The World: A Unified Framework for Deciphering Global Central Bank Commun…☆17Oct 19, 2025Updated 3 months ago
- Software to calculate atomic scattering factors and properties for Quantum Crystallography☆13Updated this week
- Pseudopotential converter from upf to psp8☆11Jan 25, 2023Updated 3 years ago
- Deep Learning Part 2, 2019 edition - transcriptions, screenshots and notebooks☆11Jul 19, 2019Updated 6 years ago
- Apache Spark Guide☆35Feb 1, 2022Updated 4 years ago
- Repo for Coursera.com online course: Statistical Inference☆10Aug 1, 2014Updated 11 years ago
- MotoGP/Linear Regression/Web Scraping☆10Mar 12, 2018Updated 7 years ago
- A comprehensive ELT pipeline for analyzing passenger satisfaction data. Features a modern data architecture with Apache Airflow for extra…☆12Oct 5, 2025Updated 4 months ago
- ☆10Jun 30, 2022Updated 3 years ago
- Generator of polynomial machine learning potentials☆19Updated this week
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- A simple example for PySpark based project.☆11Jun 3, 2016Updated 9 years ago
- Master's project - Artificial Immune System for symbolic regression.☆14May 2, 2013Updated 12 years ago
- A Watson powered conversational bot for small businesses☆16Nov 2, 2017Updated 8 years ago
- ☆12Apr 27, 2018Updated 7 years ago
- RPMD and rate constant calculations on black-box potential energy surfaces☆15Updated this week
- Fast implementation of Gradient Boosting Machine (GBM) training algorithm.☆10Aug 26, 2019Updated 6 years ago
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 9 years ago
- Python library for computing electron-phonon renormalizations from finite displacements☆11Jan 6, 2025Updated last year
- Deep Neural Networks for Python☆10Sep 22, 2015Updated 10 years ago
- Computer Science, Data Science and ML Fundamentals☆11May 30, 2025Updated 8 months ago
- Source code for 'Up and Running with DAX for Power BI' by Alison Box☆12Jun 10, 2022Updated 3 years ago
- Full-potential Linearized Augmented Plane Wave code FLEUR: All-electron DFT (repo mirror)☆16Updated this week
- Prepare topology and coordinate file for CG models in Genesis.☆13Jul 3, 2025Updated 7 months ago
- python wrapper for fdmnes data input/output☆14Mar 11, 2021Updated 4 years ago
- Machine Learning based model to predict Insurance Pure Premium☆12Jan 24, 2017Updated 9 years ago
- gammcor code☆11Sep 25, 2025Updated 4 months ago
- RFM (recency, frequency, monetary) analysis☆13Aug 11, 2018Updated 7 years ago
- Developed a recommendation system in Python using Netflix prize dataset and MovieLens data set using collaborative filtering technique to…☆11Aug 16, 2018Updated 7 years ago