ShifuML/shifu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ShifuML/shifu)

ShifuML / shifu

An end-to-end machine learning and data mining framework on Hadoop

☆258

Alternatives and similar repositories for shifu

Users that are interested in shifu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShifuML / guagua
View on GitHub
An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.
☆75May 20, 2022Updated 4 years ago
EpistasisLab / ml-analyst
View on GitHub
Analysis pipeline for quick ML analyses.
☆11Oct 4, 2018Updated 7 years ago
jpmml / jpmml-xgboost
View on GitHub
Java library and command-line application for converting XGBoost models to PMML
☆131May 25, 2026Updated last month
alipay / jpmml-sparkml-lightgbm
View on GitHub
JPMML-SparkML plugin for converting LightGBM-Spark models to PMML
☆44Oct 23, 2021Updated 4 years ago
linkedin / photon-ml
View on GitHub
A scalable machine learning library on Apache Spark
☆797Aug 30, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kanyun-inc / ytk-mp4j
View on GitHub
Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gat…
☆112Jun 14, 2017Updated 9 years ago
flink-china / community
View on GitHub
Flink China 社区介绍、参与指南
☆10Dec 26, 2018Updated 7 years ago
openscoring / openscoring
View on GitHub
REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
☆588Feb 2, 2026Updated 5 months ago
runfriends / GettingStartedWithStorm-cn
View on GitHub
翻译Getting Started with Storm
☆42Nov 14, 2014Updated 11 years ago
jpmml / jpmml-evaluator
View on GitHub
Java Evaluator API for PMML
☆905Feb 1, 2026Updated 5 months ago
leigu / brave-tracer-example
View on GitHub
☆13Oct 28, 2015Updated 10 years ago
hhxx2015 / MyLR
View on GitHub
java简单实现一些机器学习算法
☆13Dec 10, 2020Updated 5 years ago
Angel-ML / angel
View on GitHub
A Flexible and Powerful Parameter Server for large-scale machine learning
☆6,781Jun 8, 2026Updated last month
intel-machine-learning / DistML
View on GitHub
DistML provide a supplement to mllib to support model-parallel on Spark
☆170Feb 6, 2017Updated 9 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
alexholmes / hsync
View on GitHub
HDFS rsync-like utility to replicate data between HDFS clusters
☆17Jun 16, 2012Updated 14 years ago
pmerienne / trident-ml
View on GitHub
Trident-ML : A realtime online machine learning library
☆384Dec 16, 2023Updated 2 years ago
Angel-ML / automl
View on GitHub
An automatic machine learning toolkit, including hyper-parameter tuning and feature engineering.
☆61Oct 29, 2019Updated 6 years ago
amplab / MLI
View on GitHub
An API for Distributed Machine Learning
☆156Sep 22, 2016Updated 9 years ago
kanyun-inc / ytk-learn
View on GitHub
Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logi…
☆350Jul 6, 2022Updated 4 years ago
kekingcn / kk-erm-maven-plugin
View on GitHub
将erm关系描述文件生成JPA实体Entity的maven插件
☆12Oct 24, 2018Updated 7 years ago
lucidworks / solr-for-datascience
View on GitHub
☆24Oct 19, 2015Updated 10 years ago
hsperr / first_steps_in_scala
View on GitHub
☆14Aug 23, 2015Updated 10 years ago
OryxProject / oryx
View on GitHub
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
☆1,783Aug 16, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
codlife / SparkDeepDoc
View on GitHub
Some deep resources from apache spark, cloudera, my practice and so on. Most important is what i think.
☆13Dec 24, 2016Updated 9 years ago
Wheest / pytorch-lightning-cifar
View on GitHub
Common CNN models defined for PyTorch Lightning
☆10Jul 28, 2022Updated 3 years ago
lambdaji / DarwinAccelerator
View on GitHub
Google分层实验框架
☆33Jul 24, 2018Updated 7 years ago
CASISCAS / asyspark
View on GitHub
Asynchronous spark machine learning with parameter server
☆25Sep 27, 2016Updated 9 years ago
jpmml / jpmml-sparkml
View on GitHub
Java library and command-line application for converting Apache Spark ML pipelines to PMML
☆271Jun 9, 2026Updated last month
simplesteph / kafka-0.11-examples
View on GitHub
Code snippets that demonstrate how to leverage the new Kafka 0.11 APIs
☆17Aug 19, 2017Updated 8 years ago
tobegit3hub / tensorflow_capsnet
View on GitHub
TensorFlow implementation of CapsNet
☆10Apr 3, 2020Updated 6 years ago
fanruan / intelli-swift-core
View on GitHub
Distributed, Column-oriented storage, Realtime analysis, High performance Database
☆20Oct 3, 2024Updated last year
rayokota / hgraphdb
View on GitHub
HBase as a TinkerPop Graph Database
☆264Apr 29, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TalkingData / Fregata
View on GitHub
A light weight, super fast, large scale machine learning library on spark .
☆677Mar 23, 2018Updated 8 years ago
SOHUDBA / mybus
View on GitHub
MySQL数据库同redis以及hbase高速全量，增量同步工具
☆14Aug 27, 2015Updated 10 years ago
huawei-noah / streamDM
View on GitHub
Stream Data Mining Library for Spark Streaming
☆497Apr 16, 2023Updated 3 years ago
haoch / flink-siddhi
View on GitHub
A CEP library to run Siddhi within Apache Flink™ Streaming Application (Not maintained)
☆247Dec 16, 2023Updated 2 years ago
sunbow1 / SparkMLlibDeepLearn
View on GitHub
SparkMLlibDeepLearn深度学习
☆209Aug 3, 2015Updated 10 years ago
cnkuangshi / LightCTR
View on GitHub
Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosop…
☆669Jun 17, 2019Updated 7 years ago
zeroc-ice / datastorm
View on GitHub
Data centric pub/sub framework based on Ice
☆13Oct 15, 2024Updated last year