Sotera / distributed-graph-analytics
Distributed Graph Analytics (DGA) is a compendium of graph analytics written for Bulk-Synchronous-Parallel (BSP) processing frameworks such as Giraph and GraphX. The analytics included are High Betweenness Set Extraction, Weakly Connected Components, Page Rank, Leaf Compression, and Louvain Modularity.
☆174Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for distributed-graph-analytics
- Spark / graphX implementation of the distributed louvain modularity algorithm☆312Updated 4 years ago
- A GraphX implementation of Louvain method for community detection. This project also showcases the fact that you don't need to setup a cl…☆37Updated 6 years ago
- SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.☆152Updated 4 years ago
- Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.☆360Updated last year
- k Betweenness Centrality algorithm for Spark using GraphX☆55Updated 7 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆184Updated 6 years ago
- spark graphx 的原理及相关操作的源码解析☆211Updated 7 years ago
- ☆80Updated 6 years ago
- Large-scale ML & graph analytics on Giraph☆79Updated 8 years ago
- Approximate Nearest Neighbors in Spark☆174Updated 3 years ago
- GraphChi's Java version☆238Updated 11 months ago
- Locality Sensitive Hashing for Apache Spark☆88Updated 2 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆105Updated 7 years ago
- DBSCAN clustering algorithm on top of Apache Spark☆255Updated 6 years ago
- Item and User-based KNN recommendation algorithms using PySpark☆126Updated 7 years ago
- Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark☆88Updated 5 years ago
- Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs☆313Updated last week
- Distributed Temporal Graph Analytics with Apache Flink☆245Updated this week
- Java library and command-line application for converting Apache Spark ML pipelines to PMML☆267Updated 5 months ago
- A SimRank algorithm implementation using Spark☆49Updated 10 years ago
- An experimental Graph Streaming API for Apache Flink☆140Updated 4 years ago
- An implement of Factorization Machines (LibFM)☆248Updated 6 years ago
- Java library and command-line application for converting XGBoost models to PMML☆128Updated 2 months ago
- Spark-based approximate nearest neighbor search using locality-sensitive hashing☆104Updated 8 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Updated 6 years ago
- Glint: High performance scala parameter server☆168Updated 6 years ago
- Spark-based GBM☆56Updated 4 years ago