oreillymedia / hadoop_the_definitive_guide_4e
This is the Case Study repository for Hadoop: The Definitive Guide, 4E by Tom White (O'Reilly Media)
☆26Updated 9 years ago
Alternatives and similar repositories for hadoop_the_definitive_guide_4e:
Users that are interested in hadoop_the_definitive_guide_4e are comparing it to the libraries listed below
- Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.☆237Updated 8 years ago
- Example code for the Wiley book "Machine Learning - Hands On for Developers and Technical Professionals"☆68Updated 10 years ago
- Repository for data science course Spring 14☆184Updated 10 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆166Updated 9 years ago
- Source code that accompanies the book "Hadoop in Practice, Second Edition".☆80Updated 10 years ago
- The book's repo☆272Updated 7 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- Simple example on how to use recommenders in Spark / MLlib☆70Updated 4 years ago
- An implementation of a real-world map-reduce workflow in each major framework.☆151Updated 8 years ago
- A curated list of awesome Machine Learning frameworks, libraries and software.☆47Updated 10 years ago
- Simple Spark Application☆76Updated last year
- Course homepages for courses that I've taught at the University of Maryland☆55Updated 9 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- Source code for 'Practical Hive' by Scott Shaw, Andreas François Vermeulen, Ankur Gupta, and David Kjerrumgaard☆34Updated 7 years ago
- ☆44Updated 7 years ago
- Examples for learning spark☆332Updated 9 years ago
- Online machine learning algorithms based on Spark streaming☆12Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆203Updated 5 years ago
- Source code for 'Java Threads and the Concurrency Utilities' by JEFF FRIESEN☆35Updated 7 years ago
- Data directory for the CS109 Data Science course☆66Updated 10 years ago
- Files for my scikit-learn tutorial at PyCon 2013☆170Updated 8 years ago
- Taming Text Book Source Code☆379Updated last year
- Repository for MapReduce Design Patterns (O'Reilly 2012) example source code☆235Updated 9 years ago
- kmeans☆62Updated 7 months ago
- ☆26Updated 9 years ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆350Updated 3 years ago
- Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework☆42Updated 12 years ago