grafos-ml / okapi
Large-scale ML & graph analytics on Giraph
☆79Updated 9 years ago
Alternatives and similar repositories for okapi:
Users that are interested in okapi are comparing it to the libraries listed below
- GraphChi's Java version☆238Updated last year
- Scalable Graph Mining☆61Updated 2 years ago
- Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.☆360Updated 2 years ago
- Testing framework for Collaborative Filtering☆38Updated 9 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- ADMM based large scale logistic regression☆337Updated last year
- ☆110Updated 7 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 8 years ago
- *Experimental* GraphChi-DB graph database with computational capabilities☆79Updated 9 years ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆128Updated 9 years ago
- Locality Sensitive Hashing for Apache Spark☆88Updated 2 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Updated 6 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆106Updated 8 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆426Updated 8 years ago
- Topic Modeling on Apache Spark☆94Updated 5 years ago
- Factorization Machines on Spark and Glint☆25Updated 8 years ago
- SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.☆152Updated 4 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- An experimental Graph Streaming API for Apache Flink☆141Updated 4 years ago
- Reactive Factorization Engine☆104Updated 9 years ago
- Distributed Graph Analytics (DGA) is a compendium of graph analytics written for Bulk-Synchronous-Parallel (BSP) processing frameworks su…☆174Updated 6 years ago
- Locality Sensitive Hashing for Apache Spark☆195Updated 8 years ago
- Splash Project for parallel stochastic learning☆94Updated 7 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- Machine Learning Tool Kit☆136Updated 4 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 9 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆166Updated 7 years ago
- My blogs☆46Updated 8 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆147Updated 9 years ago