MicrosoftResearch / Dryad
This is a research prototype of the Dryad and DryadLINQ data-parallel processing frameworks running on Hadoop YARN.
☆324Updated 10 years ago
Alternatives and similar repositories for Dryad:
Users that are interested in Dryad are comparing it to the libraries listed below
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆517Updated 3 years ago
- Mirror of Apache Hama☆131Updated 5 years ago
- Enabling queries on compressed data.☆278Updated last year
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 11 years ago
- Sample implementations using Naiad☆38Updated 10 years ago
- GraphView is a DLL library that enables users to use SQL Server or Azure SQL Database to efficiently manage graphs.☆529Updated last year
- Transactional Support for HBase (Mirror of https://github.com/apache/incubator-omid)☆300Updated 7 years ago
- Mirror of Apache Giraph☆618Updated last year
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆243Updated 9 years ago
- Sparrow scheduling platform (U.C. Berkeley).☆319Updated 4 years ago
- FishStore is a prototype fast ingestion and querying layer for flexible-schema data☆222Updated last year
- Robust Distributed System Nucleus (rDSN) is an open framework for quickly building and managing high performance and robust distributed s…☆965Updated 10 months ago
- GraphChi's Java version☆237Updated last year
- A streaming / online query processing / analytics engine based on Apache Storm☆271Updated 7 years ago
- Low latency, strong consistency, fault tolerant distributed key value store. Colocate data and compute to achieve best performance cloud …☆114Updated 9 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆106Updated 8 years ago
- This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one☆560Updated 2 years ago
- Real²time Exploratory Analytics on Large Datasets☆122Updated 5 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆637Updated last year
- moved to https://github.com/dmlc/ps-lite☆648Updated 9 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆426Updated 8 years ago
- Replicated State Library. RSL is the Azure Paxos implementation which is used by multiple products in Azure and Bing. It provides the tra…☆76Updated last year
- DistML provide a supplement to mllib to support model-parallel on Spark☆166Updated 8 years ago
- Apache Quickstep Incubator - This project is retired☆95Updated 6 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆470Updated 7 years ago
- The Naiad system provides fast incremental and iterative computation for data-parallel workloads☆24Updated 7 years ago
- This is the official mirror of the MonetDB Mercurial repository. Please note that we do not accept pull requests on github. The regressio…☆311Updated 4 years ago
- MacroBase: A Search Engine for Fast Data☆665Updated 2 years ago
- Mirror of Apache Apex core☆349Updated 3 years ago
- SDK for Turi's GraphLab Create.☆149Updated 7 years ago