AtlasPilotPuppy/SparkAlgorithms

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AtlasPilotPuppy/SparkAlgorithms)

AtlasPilotPuppy / SparkAlgorithms

Additional useful algorithms that can be used with spark.

☆24

Alternatives and similar repositories for SparkAlgorithms

Users that are interested in SparkAlgorithms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bellettif / sparkGeoTS
View on GitHub
☆12Apr 8, 2016Updated 10 years ago
stdatalabs / aadhaar-dataset-analysis
View on GitHub
An analysis on Aadhaar dataset using Mapreduce and Spark
☆14Feb 28, 2018Updated 8 years ago
ceteri / spark-exercises
View on GitHub
Coding exercises for Apache Spark
☆103Jun 4, 2015Updated 11 years ago
anilmuppalla / hpdc-scalding-spark
View on GitHub
Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark
☆15Oct 6, 2017Updated 8 years ago
gwenshap / SparkStreamingExample
View on GitHub
☆55Aug 21, 2014Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gingsmith / proxcocoa
View on GitHub
A primal-dual framework for distributed L1-regularized optimization
☆37Apr 18, 2016Updated 10 years ago
lightning-viz / lightning-scala
View on GitHub
Scala client for the Lightning data visualization server (WIP)
☆47Jun 25, 2019Updated 7 years ago
alteryx / sparkGLM
View on GitHub
An R-like GLM package for Apache Spark
☆10Aug 6, 2015Updated 10 years ago
bentaylordata / datascience
View on GitHub
Data science repo to help others
☆12Feb 10, 2016Updated 10 years ago
rbrush / kite-apps
View on GitHub
Prescriptive Applications over Kite and Hadoop
☆12Oct 14, 2015Updated 10 years ago
alexanderfefelov / eai-patterns-with-actor-model
View on GitHub
EAI Patterns with Actor Model by Vaughn Vernon
☆14Jan 18, 2014Updated 12 years ago
BD2KGenomics / conductor
View on GitHub
Efficient, distributed downloads of large files from S3 to HDFS using Spark.
☆17Apr 26, 2017Updated 9 years ago
big-data-research / in-memory-data-pipeline
View on GitHub
The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.
☆10Jun 1, 2015Updated 11 years ago
tresata / spark-sorted
View on GitHub
Secondary sort and streaming reduce for Apache Spark
☆77Jul 3, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
phatak-dev / java-sizeof
View on GitHub
Memory consumption estimator for Scala/Java
☆27Nov 24, 2014Updated 11 years ago
Aivean / scala-to-java
View on GitHub
Command line tool that transpiles scala code into java code.
☆12Sep 26, 2015Updated 10 years ago
spoddutur / graph-knowledge-browser
View on GitHub
Real-time query spark and visualise it as graph.
☆24Oct 4, 2017Updated 8 years ago
edouardfouche / neural-based-outlier-discovery
View on GitHub
Experiments about the use of neural networks to discover outliers in high-dimensional data
☆10May 17, 2017Updated 9 years ago
FurongHuang / spectrallda-tensorspark
View on GitHub
Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.
☆104Jul 2, 2018Updated 8 years ago
mkrcah / scala-kafka-twitter
View on GitHub
Example integration of Kafka, Avro & Spark-Streaming on live Twitter feed
☆22Jan 23, 2015Updated 11 years ago
PrincetonUniversity / fcma-toolbox
View on GitHub
Princeton Full Correlation Matrix Analysis (FCMA) Toolbox
☆18Dec 8, 2016Updated 9 years ago
bythebay / pipeline
View on GitHub
Complete Pipeline Training at Big Data Scala By the Bay
☆71Oct 27, 2015Updated 10 years ago
Huawei-Spark / Backup-Repo
View on GitHub
The released version of Astro(Spark SQL on HBase) has been moved to:
☆16Jul 23, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jdgoldie / cassandra-docker-compose
View on GitHub
Dockerfile and docker-compose file for running a simple Cassandra cluster
☆14Apr 21, 2015Updated 11 years ago
OndraFiedler / spark-recommender
View on GitHub
Scalable recommendation system written in Scala using the Apache Spark framework
☆105Jan 30, 2015Updated 11 years ago
codeAshu / SparkAlgorithms
View on GitHub
Additional useful algorithms that can be used with spark.
☆12Feb 2, 2015Updated 11 years ago
dleung / spark-streaming-kafka-example
View on GitHub
An example project using Spark Streaming with Kafka message and Avro serialization
☆12Aug 21, 2015Updated 10 years ago
gjreda / cy-young-NL-2015
View on GitHub
Using data to dig into the 2015 NL Cy Young race
☆10Nov 19, 2015Updated 10 years ago
TugdualSarazin / spark-clustering
View on GitHub
Some Spark implementations of clustering algorithms.
☆19Nov 13, 2018Updated 7 years ago
lumiata / tech_blog
View on GitHub
Follow the Lumiata Tech Blog on Medium!
☆20May 8, 2023Updated 3 years ago
aws-samples / aws-mobile-self-paced-labs-samples
View on GitHub
☆15Jan 25, 2018Updated 8 years ago
datadudes / salesforce2hadoop
View on GitHub
Import Salesforce data into Hadoop HDFS in Avro format
☆23Jan 8, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
databricks / spark-knowledgebase
View on GitHub
Spark Knowledge Base
☆333Oct 1, 2020Updated 5 years ago
DisruptiveLabs / lambda-thumbnailer
View on GitHub
AWS Lambda thumbnailer with support for GIFs and PDFs (first page/frame)
☆14Feb 17, 2016Updated 10 years ago
ofermend / IPython-notebooks
View on GitHub
Some IPython notebooks I've created...
☆29Mar 17, 2016Updated 10 years ago
seanpquig / confluent-platform-spark-streaming
View on GitHub
Working example of consuming Avro data from Kafka with Spark Streaming
☆12Feb 21, 2016Updated 10 years ago
mrsqueeze / spark-hash
View on GitHub
Locality Sensitive Hashing for Apache Spark
☆198Nov 1, 2016Updated 9 years ago
pmerienne / iterative-cf
View on GitHub
storm/trident based highly scalable recommendation engine
☆17Jun 25, 2013Updated 13 years ago
kickback / time_series_with_python
View on GitHub
☆10Sep 16, 2016Updated 9 years ago