ofermend / medicare-demo
A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data
☆47Updated 9 years ago
Alternatives and similar repositories for medicare-demo:
Users that are interested in medicare-demo are comparing it to the libraries listed below
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- ☆92Updated 9 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- Machine Learning for Cascading☆81Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- Large-scale ML & graph analytics on Giraph☆79Updated 9 years ago
- ☆20Updated 8 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Distributed Matrix Library☆71Updated 8 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆37Updated 10 years ago
- ☆24Updated 10 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆148Updated 3 years ago
- Java implementation of the Microsoft's AdPredictor algorithm☆17Updated 7 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 7 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- Public code files for the DDL blog☆56Updated 6 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Recommendations Serving Engine using python☆28Updated 9 years ago
- Source code for the tutorial series at http://www.thoughtly.co/blog/prototype☆32Updated 10 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 8 years ago