soundcloud / spdtLinks
Streaming Parallel Decision Tree
☆54Updated this week
Alternatives and similar repositories for spdt
Users that are interested in spdt are comparing it to the libraries listed below
Sorting:
- Reduce your data. A unix filter for algebird-powered aggregation.☆140Updated 8 years ago
 - Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
 - Fluent Scala DSL for Google's Cloud Dataflow SDK☆56Updated 10 years ago
 - A scala-based feature generation and modeling framework☆61Updated 7 years ago
 - Bloofi: A java implementation of multidimensional Bloom filters☆83Updated 4 months ago
 - something to help you spark☆64Updated 7 years ago
 - Scala binding for ZeroMQ☆71Updated 12 years ago
 - Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
 - Beautiful trees, without the landscaping.☆140Updated 3 years ago
 - Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
 - ☆46Updated 8 years ago
 - Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 3 years ago
 - Secondary sort and streaming reduce for Apache Spark☆78Updated 2 years ago
 - Embeddable multi-Paxos For The JVM☆77Updated 10 months ago
 - Scala client for the Lightning data visualization server (WIP)☆47Updated 6 years ago
 - Aerospike Spark Connector☆35Updated 8 years ago
 - This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
 - Probabilistic data structures for Guava.☆54Updated 5 years ago
 - ☆39Updated 9 years ago
 - Sparse feature extraction with Spark☆30Updated 7 years ago
 - A Scala DSL for the Kompics framework☆19Updated 3 years ago
 - Scalable Machine Learning in Scalding☆360Updated 7 years ago
 - Cantor provides utilities for estimating the cardinality of large sets.☆84Updated 3 years ago
 - Kompics - A message-passing component model for building distributed systems☆66Updated 3 years ago
 - A CPU and GPU-accelerated matrix library for data mining☆265Updated 4 years ago
 - Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 10 years ago
 - Storehaus is a library that makes it easy to work with asynchronous key value stores☆465Updated 5 years ago
 - सूचि - Toolkit to build Distributed Data Systems☆53Updated 2 years ago
 - Functional, Typesafe, Declarative Data Pipelines☆139Updated 7 years ago
 - MLeap allows for easily putting Spark ML pipelines into production☆78Updated 9 years ago