Additional useful algorithms that can be used with spark.
☆24Dec 24, 2014Updated 11 years ago
Alternatives and similar repositories for SparkAlgorithms
Users that are interested in SparkAlgorithms are comparing it to the libraries listed below
Sorting:
- ☆12Apr 8, 2016Updated 9 years ago
- An analysis on Aadhaar dataset using Mapreduce and Spark☆14Feb 28, 2018Updated 8 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark☆15Oct 6, 2017Updated 8 years ago
- ☆56Aug 21, 2014Updated 11 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Jun 25, 2019Updated 6 years ago
- A primal-dual framework for distributed L1-regularized optimization☆37Apr 18, 2016Updated 9 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- Repo with sources for Spark blog posts and learning experiments in Spark☆14Oct 16, 2015Updated 10 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- ☆12Sep 26, 2019Updated 6 years ago
- A comprehensive travel time dataset for the entire landmass of the world to the nearest town.☆15Oct 31, 2018Updated 7 years ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Apr 26, 2017Updated 8 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- Scala bindings for JTS☆10Jan 3, 2014Updated 12 years ago
- Memory consumption estimator for Scala/Java☆26Nov 24, 2014Updated 11 years ago
- Command line tool that transpiles scala code into java code.☆12Sep 26, 2015Updated 10 years ago
- Omnivore Optimizer and Distributed CcT☆13Jun 17, 2016Updated 9 years ago
- Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.☆104Jul 2, 2018Updated 7 years ago
- Real-time query spark and visualise it as graph.☆24Oct 4, 2017Updated 8 years ago
- Experiments about the use of neural networks to discover outliers in high-dimensional data☆10May 17, 2017Updated 8 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Oct 27, 2015Updated 10 years ago
- Scalable recommendation system written in Scala using the Apache Spark framework☆105Jan 30, 2015Updated 11 years ago
- ☆15Jan 25, 2018Updated 8 years ago
- The released version of Astro(Spark SQL on HBase) has been moved to:☆16Jul 23, 2015Updated 10 years ago
- A WIP Udemy downloader written in Go☆11Mar 20, 2022Updated 4 years ago
- Additional useful algorithms that can be used with spark.☆13Feb 2, 2015Updated 11 years ago
- Simplify getting Zeppelin up and running☆56Jul 20, 2016Updated 9 years ago
- Some Spark implementations of clustering algorithms.☆19Nov 13, 2018Updated 7 years ago
- Some IPython notebooks I've created...☆29Mar 17, 2016Updated 10 years ago
- Using data to dig into the 2015 NL Cy Young race☆10Nov 19, 2015Updated 10 years ago
- Locality Sensitive Hashing for Apache Spark☆197Nov 1, 2016Updated 9 years ago
- ☆11Sep 16, 2016Updated 9 years ago
- Import Salesforce data into Hadoop HDFS in Avro format☆23Jan 8, 2020Updated 6 years ago
- Big Spatial Data Processing using Spark☆146Mar 7, 2017Updated 9 years ago
- Factorization Machines on Spark and Glint☆25Nov 7, 2016Updated 9 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Jan 21, 2016Updated 10 years ago