Sparse feature extraction with Spark
☆30Jul 25, 2018Updated 7 years ago
Alternatives and similar repositories for modelmatrix
Users that are interested in modelmatrix are comparing it to the libraries listed below
Sorting:
- Haskell implementation of HyperLogLog++ & MinHash for efficient cardinality and intersection estimation☆12Aug 1, 2016Updated 9 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Feb 1, 2018Updated 8 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Jan 21, 2016Updated 10 years ago
- Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.☆14Nov 3, 2016Updated 9 years ago
- Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization imp…☆19Jul 21, 2014Updated 11 years ago
- ☆10May 3, 2015Updated 10 years ago
- Spark-based approximate nearest neighbor search using locality-sensitive hashing☆104Jul 5, 2016Updated 9 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Dec 28, 2016Updated 9 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Jun 18, 2016Updated 9 years ago
- Visualize streaming machine learning in Spark☆177Jun 29, 2017Updated 8 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- a thin scala wrapper for jedis (https://github.com/xetorthio/jedis)☆61Jan 10, 2017Updated 9 years ago
- Analyzing Twitter real time feed with Spark Streaming☆32Feb 27, 2015Updated 11 years ago
- Raspberry Pi Turta röle kartını görsel arayüz üzerinden kontrol eden python dili ile yazılmış program☆11Nov 30, 2016Updated 9 years ago
- A collection of Apache Parquet add-on modules☆30Feb 12, 2026Updated 2 weeks ago
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆135May 5, 2022Updated 3 years ago
- Experimental pure Java revised simplex linear program solver (Apache 2.0 license)☆15Jun 22, 2020Updated 5 years ago
- ☆12Oct 25, 2015Updated 10 years ago
- A primal-dual framework for distributed L1-regularized optimization☆37Apr 18, 2016Updated 9 years ago
- Utilities for building distributed systems on top of mesos☆23Aug 25, 2018Updated 7 years ago
- BigDataBench Spark workloads☆11Jul 15, 2016Updated 9 years ago
- Parallel programs with OpenMPI☆10Apr 1, 2015Updated 10 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- Scala collection views meet Transducers hype☆41Oct 13, 2015Updated 10 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 3 months ago
- A tutorial on Apache Spark Unit Testing☆37Jan 27, 2016Updated 10 years ago
- Coursera Machine Learning class examples in Spark☆43Feb 14, 2014Updated 12 years ago
- Demo for Sitepoint Docker Compose article☆11Oct 1, 2015Updated 10 years ago
- Part-of-speech tagger implemented using a feedforward network in TensorFlow☆14Jan 15, 2018Updated 8 years ago
- Implementation of the K-Means clustering algorithm using Hadoop.☆10Apr 30, 2012Updated 13 years ago
- 编译语言实现模式例程☆11Nov 22, 2014Updated 11 years ago
- 抓取国家统计局数据☆13May 4, 2016Updated 9 years ago
- Advertising delivery engine based on SF1R☆16Sep 23, 2014Updated 11 years ago
- ☆11Apr 10, 2014Updated 11 years ago
- A Scala Swing component that wraps javax.swing.JTree☆15Feb 4, 2013Updated 13 years ago
- Footstep planning and Trajectory Optimization☆10Apr 12, 2015Updated 10 years ago