infochimps-labs / data_science_fun_pack
Meta-repository of big data tools -- source and essential plugins for hadoop, pig, wukong, storm, kafka etc.
☆29Updated 10 years ago
Alternatives and similar repositories for data_science_fun_pack:
Users that are interested in data_science_fun_pack are comparing it to the libraries listed below
- Generating the next read for our book club- with Data Science!☆40Updated 9 years ago
- Data-Intensive Text Processing with MapReduce☆625Updated 4 years ago
- PythonForDataScience☆155Updated 8 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- Muppet☆126Updated 3 years ago
- Data and example code for Programming Pig, by Alan F. Gates☆187Updated 8 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- ☆9Updated 9 years ago
- Presentation at Perth Data Science Meetup, February 2015☆72Updated 10 years ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 9 years ago
- Hadoop Map-Reduce Design Patterns☆72Updated 2 years ago
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Updated 2 years ago
- A set of Hadoop utilities to make working with Hadoop a little easier.☆26Updated 5 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 9 years ago
- Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive☆286Updated 8 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.☆238Updated 8 years ago
- Real-Time Analytics with Storm☆79Updated 2 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 11 years ago
- HDP Data Science/Machine Learning demo☆37Updated 9 years ago
- ☆48Updated 8 years ago
- Hadoop Cluster Configurations☆32Updated 3 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- A free electronic book about Apache Hive. The book is geared towards SQL-knowledgeable business users with some advanced tips for devops.…☆103Updated 7 years ago
- Hadoop training material from free MapR courses.☆53Updated 8 years ago
- A platform for real-time streaming search☆103Updated 9 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38Updated 5 years ago
- Introduction to Statistics☆231Updated 9 years ago
- A companion wiki + code repository for the O'Reilly Media video "Just Enough Math". This site provides additional links, sample code, and…☆155Updated 2 years ago
- A package that allows R developers to use Hadoop HBase☆48Updated 10 years ago