TugdualSarazin / spark-clustering
Some Spark implementations of clustering algorithms.
☆19Updated 6 years ago
Alternatives and similar repositories for spark-clustering:
Users that are interested in spark-clustering are comparing it to the libraries listed below
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 9 years ago
- functionstest☆33Updated 8 years ago
- Example of running a Genetic Algorithm (Travelling Salesman) on Apache Spark☆44Updated 8 years ago
- ☆33Updated 9 years ago
- An example of using Avro and Parquet in Spark SQL☆60Updated 9 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- Example Spark project using Parquet as a columnar store with Thrift objects.☆48Updated 10 years ago
- SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.☆152Updated 4 years ago
- Scripts for parsing / making sense of yarn logs☆52Updated 8 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆28Updated 8 years ago
- Utilities for Apache Spark☆34Updated 9 years ago
- Low level integration of Spark and Kafka☆130Updated 7 years ago
- Project defining the docker image that will support examples of algorithms created in this organization☆13Updated 7 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Source code of Blog at☆52Updated 2 months ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 8 years ago
- something to help you spark☆65Updated 6 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- ☆92Updated 7 years ago
- Spark Modularized View☆42Updated 4 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- An umbrella project for multiple implementations of model serving☆45Updated 7 years ago
- ☆21Updated 10 years ago
- Library for organizing batch processing pipelines in Apache Spark☆41Updated 8 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 10 years ago