tony-framework/TonY

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tony-framework/TonY)

tony-framework / TonY

TonY is a framework to natively run deep learning frameworks on Apache Hadoop.

☆708

Alternatives and similar repositories for TonY

Users that are interested in TonY are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yahoo / TensorFlowOnSpark
View on GitHub
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
☆3,846Jul 10, 2023Updated 3 years ago
tensorflow / ecosystem
View on GitHub
Integration of TensorFlow with other open-source frameworks
☆1,378Sep 25, 2024Updated last year
linkedin / Avro2TF
View on GitHub
Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.
☆129May 9, 2020Updated 6 years ago
apache / submarine
View on GitHub
Submarine is Cloud Native Machine Learning Platform.
☆706Apr 3, 2024Updated 2 years ago
apache / yunikorn-core
View on GitHub
Apache YuniKorn Core
☆1,021Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
linkedin / dr-elephant
View on GitHub
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
☆1,370Aug 22, 2023Updated 2 years ago
Qihoo360 / hbox
View on GitHub
AI on Hadoop
☆1,729Jul 1, 2025Updated last year
linkedin / spark-tfrecord
View on GitHub
Read and write Tensorflow TFRecord data from Apache Spark.
☆300Apr 22, 2024Updated 2 years ago
linkedin / photon-ml
View on GitHub
A scalable machine learning library on Apache Spark
☆797Aug 30, 2021Updated 4 years ago
Angel-ML / angel
View on GitHub
A Flexible and Powerful Parameter Server for large-scale machine learning
☆6,784Jun 8, 2026Updated last month
horovod / horovod
View on GitHub
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
☆14,694Jun 20, 2026Updated last month
linkedin / kube2hadoop
View on GitHub
Secure HDFS Access from Kubernetes
☆61Jun 11, 2020Updated 6 years ago
criteo / tf-yarn
View on GitHub
Train TensorFlow models on YARN in just a few lines of code!
☆93Nov 3, 2023Updated 2 years ago
databricks / spark-deep-learning
View on GitHub
Deep Learning Pipelines for Apache Spark
☆1,989Mar 30, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bytedance / byteps
View on GitHub
A high performance and generic framework for distributed DNN training
☆3,718Oct 3, 2023Updated 2 years ago
jcrist / skein
View on GitHub
A tool and library for easily deploying applications on Apache YARN
☆145Mar 12, 2024Updated 2 years ago
linkedin / dynamometer
View on GitHub
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
☆135Jan 11, 2024Updated 2 years ago
alibaba / x-deeplearning
View on GitHub
An industrial deep learning framework for high-dimension sparse data
☆4,301Sep 25, 2024Updated last year
uber / petastorm
View on GitHub
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…
☆1,888Jan 2, 2026Updated 6 months ago
salesforce / TransmogrifAI
View on GitHub
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…
☆2,277Jun 2, 2026Updated last month
kubeflow / kubeflow
View on GitHub
Machine Learning Toolkit for Kubernetes
☆15,788Jul 10, 2026Updated last week
microsoft / SynapseML
View on GitHub
Simple and Distributed Machine Learning
☆5,233Jul 6, 2026Updated 2 weeks ago
intel / ipex-llm
View on GitHub
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…
☆8,866Jan 28, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week
sql-machine-learning / elasticdl
View on GitHub
Kubernetes-native Deep Learning Framework
☆744Jan 26, 2024Updated 2 years ago
dmlc / difacto
View on GitHub
Distributed Factorization Machines
☆299Mar 23, 2016Updated 10 years ago
dmlc / ps-lite
View on GitHub
A lightweight parameter server interface
☆1,561Mar 2, 2026Updated 4 months ago
flink-extended / dl-on-flink
View on GitHub
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep le…
☆693Nov 12, 2024Updated last year
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,213Apr 29, 2025Updated last year
alibaba / Alink
View on GitHub
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
☆3,610Jun 7, 2024Updated 2 years ago
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
tensorflow / serving
View on GitHub
A flexible, high-performance serving system for machine learning models
☆6,356Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
combust / mleap
View on GitHub
MLeap: Deploy ML Pipelines to Production
☆1,539Updated this week
linkedin / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆16Jul 24, 2023Updated 2 years ago
yaooqinn / spark-authorizer
View on GitHub
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆183Apr 6, 2022Updated 4 years ago
Qihoo360 / tensornet
View on GitHub
a TensorFlow-based distributed training framework optimized for large-scale sparse data.
☆333Apr 10, 2026Updated 3 months ago
criteo / babar
View on GitHub
Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
☆129Sep 7, 2018Updated 7 years ago
qiaoguan / deep-ctr-prediction
View on GitHub
CTR prediction models based on deep learning(基于深度学习的广告推荐CTR预估模型)
☆938Nov 15, 2019Updated 6 years ago
microsoft / pai
View on GitHub
Resource scheduling and cluster management for AI
☆2,685Jun 6, 2024Updated 2 years ago