An end-to-end machine learning and data mining framework on Hadoop
☆258May 13, 2024Updated 2 years ago
Alternatives and similar repositories for shifu
Users that are interested in shifu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.☆75May 20, 2022Updated 4 years ago
- Analysis pipeline for quick ML analyses.☆11Oct 4, 2018Updated 7 years ago
- Java library and command-line application for converting XGBoost models to PMML☆131Apr 26, 2026Updated last month
- JPMML-SparkML plugin for converting LightGBM-Spark models to PMML☆43Oct 23, 2021Updated 4 years ago
- A scalable machine learning library on Apache Spark☆797Aug 30, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gat…☆112Jun 14, 2017Updated 8 years ago
- REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models☆588Feb 2, 2026Updated 3 months ago
- 翻译Getting Started with Storm☆42Nov 14, 2014Updated 11 years ago
- Java Evaluator API for PMML☆903Feb 1, 2026Updated 3 months ago
- ☆13Oct 28, 2015Updated 10 years ago
- java简单实现一些机器学习算法☆13Dec 10, 2020Updated 5 years ago
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,789May 8, 2026Updated 3 weeks ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆170Feb 6, 2017Updated 9 years ago
- HDFS rsync-like utility to replicate data between HDFS clusters☆17Jun 16, 2012Updated 13 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Trident-ML : A realtime online machine learning library☆384Dec 16, 2023Updated 2 years ago
- An automatic machine learning toolkit, including hyper-parameter tuning and feature engineering.☆61Oct 29, 2019Updated 6 years ago
- class project for cs263, Spring 2018☆12Jun 13, 2018Updated 7 years ago
- An API for Distributed Machine Learning☆156Sep 22, 2016Updated 9 years ago
- Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logi…☆350Jul 6, 2022Updated 3 years ago
- 将erm关系描述文件生成JPA实体Entity的maven插件☆12Oct 24, 2018Updated 7 years ago
- ☆24Oct 19, 2015Updated 10 years ago
- Leon Bottou's SGD☆34Oct 13, 2011Updated 14 years ago
- 轻量级JAVA实时业务风控系统框架☆767May 28, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Aug 23, 2015Updated 10 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,785Aug 16, 2021Updated 4 years ago
- Some deep resources from apache spark, cloudera, my practice and so on. Most important is what i think.☆13Dec 24, 2016Updated 9 years ago
- Common CNN models defined for PyTorch Lightning☆10Jul 28, 2022Updated 3 years ago
- Google分层实验框架☆33Jul 24, 2018Updated 7 years ago
- Asynchronous spark machine learning with parameter server☆25Sep 27, 2016Updated 9 years ago
- Distributed Tensorflow best practices template using Tensorflow Estimator API☆17Mar 19, 2019Updated 7 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- This shows how to embedd Hystrix in a non invasive manner into existing Spring applications.☆24May 5, 2014Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Java library and command-line application for converting Apache Spark ML pipelines to PMML☆271Apr 11, 2026Updated last month
- TensorFlow implementation of CapsNet☆10Apr 3, 2020Updated 6 years ago
- Distributed, Column-oriented storage, Realtime analysis, High performance Database☆20Oct 3, 2024Updated last year
- HBase as a TinkerPop Graph Database☆264Apr 29, 2026Updated last month
- A light weight, super fast, large scale machine learning library on spark .☆677Mar 23, 2018Updated 8 years ago
- MySQL数据库同redis以及hbase高速全量,增量同步工具☆14Aug 27, 2015Updated 10 years ago
- Stream Data Mining Library for Spark Streaming☆497Apr 16, 2023Updated 3 years ago