skrusche63/spark-fsm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/skrusche63/spark-fsm)

skrusche63 / spark-fsm

This project provides sequential pattern mining for Apache Spark. The algorithms are based on the work of Philippe Fournier-Viger and comprise his SPADE and TSR algorithm. This enables to perform sequential pattern and also sequential rule mining.

☆29

Alternatives and similar repositories for spark-fsm

Users that are interested in spark-fsm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

skrusche63 / spark-arules
View on GitHub
This project provides association rule mining for Apache Spark. The algorithms are based on the work of Philippe Fournier-Viger and comp…
☆29Mar 10, 2015Updated 11 years ago
mengxr / spark-als
View on GitHub
Another, hopefully better, implementation of ALS on Spark
☆14May 20, 2015Updated 11 years ago
tizfa / sparkboost
View on GitHub
A distributed implementation of AdaBoost.MH and MP-Boost using Apache Spark
☆18Jul 7, 2016Updated 10 years ago
skrusche63 / spark-fm
View on GitHub
Reactive Factorization Engine
☆105Feb 18, 2015Updated 11 years ago
skrusche63 / spark-piwik
View on GitHub
Beyond Piwik Analytics with Scala and Apache Spark
☆46Nov 30, 2014Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tresata / spark-scalding
View on GitHub
Use Cascading Taps and Scalding DSL with Spark
☆49Dec 28, 2016Updated 9 years ago
big-data-research / in-memory-data-pipeline
View on GitHub
The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.
☆10Jun 1, 2015Updated 11 years ago
alteryx / sparkGLM
View on GitHub
An R-like GLM package for Apache Spark
☆10Aug 6, 2015Updated 10 years ago
trulia / thoth-ml
View on GitHub
☆15Jan 3, 2015Updated 11 years ago
matroid / matroid-python
View on GitHub
Python bindings for Matroid API
☆18Aug 14, 2025Updated 11 months ago
rstudio / expert
View on GitHub
Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…
☆11Sep 14, 2015Updated 10 years ago
mbartoli / deep-simplification
View on GitHub
Text simplification using RNNs
☆55Mar 31, 2016Updated 10 years ago
lotze / bandit
View on GitHub
R package for split test/one-armed bandit analysis
☆16May 5, 2014Updated 12 years ago
topepo / odsc_rules
View on GitHub
Notes and code for the workshop "Rule-Based Models for Regression and Classification”
☆13May 21, 2016Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
uwescience / myria-web
View on GitHub
Web frontend for Myria
☆12Sep 30, 2020Updated 5 years ago
mtarsel / Django-MOOC
View on GitHub
MOOC Project for software design and development taught at Clarkson University
☆16Feb 11, 2022Updated 4 years ago
sinanuozdemir / sfdat22
View on GitHub
SF DAT 22 Course Repository
☆13Jun 3, 2016Updated 10 years ago
ClaudiuCreanga / hands-on-machine-learning-scikit-learn-tensorflow-oreilly-geron
View on GitHub
Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron
☆10May 11, 2017Updated 9 years ago
skrusche63 / spark-connect
View on GitHub
A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…
☆13Feb 23, 2015Updated 11 years ago
ProjectTw / TwitteR2Mongo
View on GitHub
R Package to stream and analyze tweets using a mongodb
☆13Mar 1, 2016Updated 10 years ago
szilard / dataset-sizes-kdnuggets
View on GitHub
Size of datasets used for analytics based on 10 years of surveys by KDnuggets.
☆16Nov 18, 2015Updated 10 years ago
lambdazen / pixy
View on GitHub
Pixy is a declarative vendor-independent graph query language built on the Tinkerpop software stack
☆39Jun 25, 2026Updated 3 weeks ago
tuplejump / embedded-kafka
View on GitHub
Embedded Kafka for testing and quick prototyping.
☆14Apr 19, 2016Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MangoTheCat / Modelling-Airbnb-Prices
View on GitHub
Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)
☆10Feb 5, 2019Updated 7 years ago
vighneshbirodkar / pca
View on GitHub
A comparison of various Robust PCA implementations
☆15Apr 19, 2016Updated 10 years ago
KarrLab / datanator
View on GitHub
Toolkit for discovering and aggregating data for whole-cell modeling
☆15Jan 19, 2022Updated 4 years ago
clips / yarn
View on GitHub
Disambiguating biomedical and clinical concepts with word embeddings
☆15Apr 17, 2018Updated 8 years ago
alipay / hypro_tpp
View on GitHub
PyTorch Implementation of Hybridly Normalized Probabilistic Model for Long-Horizon Prediction of Event Sequence, NeurIPS 2022
☆20Nov 20, 2022Updated 3 years ago
inchara1990 / R-code-Classifiers
View on GitHub
The R code compares the performance metrics between logistic regression, SVM, Naive Bayes, Knn and random forest classifers in a 10 fold …
☆15Mar 13, 2016Updated 10 years ago
adobe / Marketo-SSFS-Service-Provider-Interface
View on GitHub
☆12Apr 8, 2024Updated 2 years ago
BenderV / self_driving_car
View on GitHub
🚗 mini self driving car
☆18Sep 7, 2016Updated 9 years ago
lloydmeta / sparkka-streams
View on GitHub
Power a Spark Stream from anywhere in your Akka Stream Flow
☆12Mar 1, 2016Updated 10 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jhwhite / datasceinceblogs
View on GitHub
Collection of blogs dedicated to Data Science, Analytics, Big Data and other popular Data Science topics
☆17May 14, 2014Updated 12 years ago
guanlan / lua-cmsgpack
View on GitHub
A self contained Lua MessagePack C implementation.
☆15Jul 17, 2013Updated 13 years ago
amplab / velox-modelserver
View on GitHub
☆110Apr 17, 2017Updated 9 years ago
bakins / lua-resty-beanstalkd
View on GitHub
Simple Beanstalkd client for nginx/openresty
☆16Aug 17, 2012Updated 13 years ago
tobert / tobert.github.io
View on GitHub
@MissAmyTobey Writes
☆49Jul 15, 2026Updated last week
stealthly / punxsutawney
View on GitHub
An Apache Mesos Framework that allows for replaying load over and over and over (and over) again
☆10Aug 10, 2015Updated 10 years ago
benoitdancoisne / SparkMaxFlow
View on GitHub
Spark implementation of Ford-Fulkerson algorithm
☆14Feb 11, 2018Updated 8 years ago