koooee / BigDataR_ExamplesLinks
Data Science and Machine Learning Examples for Data Science Linux
☆31Updated 13 years ago
Alternatives and similar repositories for BigDataR_Examples
Users that are interested in BigDataR_Examples are comparing it to the libraries listed below
Sorting:
- training material☆47Updated last year
- Deep learning made easy☆116Updated 11 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Updated 9 years ago
- Fast Ensembles of Sparse Trees☆38Updated 9 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 10 years ago
- A simple example application that will connect to the Twitter API, run a search, gather tweets, and then calculate the sentiment of each …☆65Updated 10 years ago
- An R-like GLM package for Apache Spark☆10Updated 10 years ago
- Defunct☆241Updated 8 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆19Updated 8 years ago
- Repository for data science course Spring 14☆185Updated 11 years ago
- How-To code samples for working with GraphLab Create.☆207Updated 9 years ago
- Public Presentations☆24Updated 9 months ago
- A package that allows R developer to use Hadoop MapReduce☆158Updated 5 years ago
- Code to create benchmarks for Kaggle's Facebook Recruiting Competition☆86Updated 13 years ago
- Oracle Data Science Bootcamp 2014☆25Updated 10 years ago
- Repository for SF QConf 2015 Workshop☆16Updated last year
- Mirror of Apache Zeppelin (Incubating)☆44Updated 9 years ago
- Random forests for R for large data sets, optimized with parallel tree-growing and disk-based memory☆91Updated 10 years ago
- Distributed Matrix Library☆72Updated 9 years ago
- the 2nd place solution for West Nile Virus Prediction challenge on Kaggle☆36Updated 10 years ago
- Diachronic text analysis in Python☆27Updated 5 years ago
- RHive is an R extension facilitating distributed computing via Apache Hive.☆123Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆32Updated 10 years ago
- Common Code Workflow tutorial on Theano☆28Updated 12 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆28Updated 11 years ago
- Script to perform dictionary based n-gram text tagging efficiently in apache spark☆11Updated 9 years ago
- Real-time dashboard for Twitter Sentiment analysis using Spark Streaming and Watson Tone Analyzer☆31Updated 6 years ago
- Source Material for using Python and Hadoop together☆13Updated 8 years ago
- Install directions and example notebooks for Udacity's Deep Learning classes☆28Updated 10 years ago