ofermend / medicare-demo
A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data
☆47Updated 9 years ago
Alternatives and similar repositories for medicare-demo:
Users that are interested in medicare-demo are comparing it to the libraries listed below
- the 2nd place solution for West Nile Virus Prediction challenge on Kaggle☆36Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- ☆24Updated 9 years ago
- Exploration Library in Java☆12Updated last year
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- ☆92Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Chapter-wise code for Agile Data the O'Reilly book☆157Updated 11 years ago
- training material☆47Updated 4 months ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- Public code files for the DDL blog☆56Updated 6 years ago
- Some IPython notebooks I've created...☆29Updated 9 years ago
- Distributed Matrix Library☆71Updated 8 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Source code for exploring MLlib blog post☆11Updated 9 years ago
- Film recommendations with Apache Spark and Python☆61Updated 9 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Templates for projects based on top of H2O.☆37Updated this week
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 7 years ago
- My 2nd place submission (working with Kevin Goetsch) out of 28 teams at the Kaggle competition at PyCon2015.☆23Updated 9 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 8 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 11 years ago
- Large-scale ML & graph analytics on Giraph☆79Updated 9 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 10 years ago
- A chef cookbook for deploying spark☆30Updated 11 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- Topic Modeling on Apache Spark☆95Updated 6 years ago