tobiajo / yarntf
Easy distributed TensorFlow on Hadoop (moved to: hops-tensorflow)
☆9Updated 7 years ago
Alternatives and similar repositories for yarntf:
Users that are interested in yarntf are comparing it to the libraries listed below
- HopsYARN Tensorflow Framework.☆32Updated 5 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Spark MLlib code optimized to efficiently support sparse data☆50Updated 8 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆62Updated 8 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- Tail a log file and send log lines automatically to a kafka topic☆57Updated 12 years ago
- What happens on the wire when Hadoop RPC call is issued?☆14Updated 2 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- IoT - It's the thing you want! And so here's a full-stack demo.☆62Updated 8 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 7 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Updated 4 years ago
- Exelixi is a distributed framework for running genetic algorithms at scale. The framework is based on Apache Mesos and the code is mostly…☆34Updated 10 years ago
- Kaltura's next generation Analytics solution based on Spark, Cassandra and Kafka☆12Updated last year
- ☆10Updated 8 years ago
- demo clients☆20Updated 7 years ago
- A spark package for loading Spark ML models to Redis-ML☆63Updated 5 years ago
- Cascading on Apache Flink®☆54Updated 11 months ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆113Updated 3 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- Github mirror of "analytics/kafkatee" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆21Updated last year
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- Muppet☆126Updated 3 years ago
- Secondary index on HBase☆18Updated 9 years ago
- scalding powered machine learning☆109Updated 10 years ago
- one large file contains a billion of small files☆14Updated 10 years ago
- Exelixi is a distributed framework based on Apache Mesos, mostly implemented in Python using gevent for high-performance concurrency. It …☆133Updated 11 years ago
- Utilities for building distributed systems on top of mesos☆24Updated 6 years ago
- [DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestr…☆23Updated 3 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago