ofermend / medicare-demoLinks
A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data
☆47Updated 9 years ago
Alternatives and similar repositories for medicare-demo
Users that are interested in medicare-demo are comparing it to the libraries listed below
Sorting:
- ☆92Updated 10 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 10 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- Training materials for Strata, AMP Camp, etc☆148Updated 10 years ago
- Scalable Machine Learning in Scalding☆360Updated 7 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Tweet Analysis with Spark☆15Updated 8 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆148Updated 4 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 6 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- ☆20Updated 9 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 7 years ago
- ☆24Updated 10 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- R Code + R Notebook for analyzing millions of Amazon reviews using Apache Spark☆85Updated 8 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 9 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆427Updated 9 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 8 years ago
- Source code for the tutorial series at http://www.thoughtly.co/blog/prototype☆32Updated 10 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 10 years ago
- A set of methods that predict the future values of popularity indices for news posts using a variety of features.☆33Updated 7 years ago
- Topic Modeling on Apache Spark☆94Updated 6 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 10 years ago
- Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that…☆32Updated 10 years ago
- Course repository for Applied Natural Language Processing☆125Updated 12 years ago
- GPU Acceleration for Apache Spark☆34Updated 10 years ago
- Film recommendations with Apache Spark and Python☆61Updated 10 years ago
- Solution to Facebook's link prediction contest on Kaggle.☆206Updated 13 years ago
- ☆48Updated 9 years ago