shiv4nsh / spark-LDA-example
A simple Spark LDA example. to demonstrate a full fletched clustering algorithm, with data cleaning using the processess like lemmatization , stemming etc.
☆23Updated 8 years ago
Alternatives and similar repositories for spark-LDA-example:
Users that are interested in spark-LDA-example are comparing it to the libraries listed below
- Spark 2.0 Scala Machine Learning examples☆77Updated 5 years ago
- Twitter sentiment analysis based on Apache Spark, MLlib, Scala and Akka.☆53Updated 8 years ago
- Elastic Search on Spark☆112Updated 10 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- Topic Modeling on Apache Spark☆94Updated 5 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆147Updated 9 years ago
- PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)☆94Updated 2 years ago
- ☆92Updated 7 years ago
- Building Annoy Index on Apache Spark☆72Updated 4 years ago
- Topic Modeling with LDA in Scala and Spark☆31Updated 6 years ago
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Updated 9 years ago
- Getting started with Spark, Spark Streaming, Spark SQL, DataFrame☆36Updated 8 years ago
- Apache Spark applications☆70Updated 7 years ago
- ElasticSearch integration for Apache Spark☆47Updated 8 years ago
- Spark Streaming HBase Example☆95Updated 8 years ago
- An example of using Avro and Parquet in Spark SQL☆60Updated 9 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- An implementation of Markov Clustering algorithm for Spark in Scala☆34Updated 7 years ago
- ScalaIO 2014 Workshop☆25Updated 10 years ago
- Low level integration of Spark and Kafka☆130Updated 6 years ago
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆106Updated 8 years ago
- spark + drools☆102Updated 2 years ago
- The Nak Machine Learning Library☆341Updated 7 years ago
- Examples for using Breeze.☆60Updated 11 years ago
- Chalk is a natural language processing library.☆258Updated 8 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 8 years ago
- ☆33Updated 9 years ago
- Getting started with Spark, Spark streaming, Spark SQL and DataFrame.☆48Updated 6 years ago