TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
☆713Oct 14, 2023Updated 2 years ago
Alternatives and similar repositories for TonY
Users that are interested in TonY are comparing it to the libraries listed below
Sorting:
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,858Jul 10, 2023Updated 2 years ago
- Integration of TensorFlow with other open-source frameworks☆1,374Sep 25, 2024Updated last year
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.☆128May 9, 2020Updated 5 years ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆298Apr 22, 2024Updated last year
- Apache YuniKorn Core☆1,002Feb 24, 2026Updated last week
- AI on Hadoop☆1,732Jul 1, 2025Updated 8 months ago
- Submarine is Cloud Native Machine Learning Platform.☆705Apr 3, 2024Updated last year
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,371Aug 22, 2023Updated 2 years ago
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,784Oct 13, 2025Updated 4 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,675Dec 1, 2025Updated 3 months ago
- Train TensorFlow models on YARN in just a few lines of code!☆93Nov 3, 2023Updated 2 years ago
- A scalable machine learning library on Apache Spark☆796Aug 30, 2021Updated 4 years ago
- A high performance and generic framework for distributed DNN training☆3,716Oct 3, 2023Updated 2 years ago
- A tool and library for easily deploying applications on Apache YARN☆146Mar 12, 2024Updated last year
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,879Jan 2, 2026Updated 2 months ago
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,272Sep 29, 2023Updated 2 years ago
- An industrial deep learning framework for high-dimension sparse data☆4,307Sep 25, 2024Updated last year
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆134Jan 11, 2024Updated 2 years ago
- MLeap: Deploy ML Pipelines to Production☆1,535Jan 12, 2026Updated last month
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆128Sep 7, 2018Updated 7 years ago
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,694Jan 28, 2026Updated last month
- Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.☆3,623Jun 7, 2024Updated last year
- Secure HDFS Access from Kubernetes☆61Jun 11, 2020Updated 5 years ago
- Machine Learning Toolkit for Kubernetes☆15,482Jan 5, 2026Updated last month
- Brings SQL and AI together.☆5,191Apr 18, 2024Updated last year
- Resource scheduling and cluster management for AI☆2,687Jun 6, 2024Updated last year
- Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep le…☆695Nov 12, 2024Updated last year
- Simple and Distributed Machine Learning☆5,201Feb 14, 2026Updated 2 weeks ago
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,157Apr 29, 2025Updated 10 months ago
- Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logi…☆350Jul 6, 2022Updated 3 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆211Dec 5, 2022Updated 3 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,308Updated this week
- SQL-based streaming analytics platform at scale☆1,226Jun 21, 2020Updated 5 years ago
- Kubernetes-native Deep Learning Framework☆746Jan 26, 2024Updated 2 years ago
- A lightweight parameter server interface☆1,560Jan 11, 2023Updated 3 years ago
- A flexible, high-performance serving system for machine learning models☆6,350Dec 18, 2025Updated 2 months ago
- Distributed Deep learning with Keras & Spark☆1,578May 1, 2023Updated 2 years ago
- Pravega - Streaming as a new software defined storage primitive☆2,005Mar 2, 2025Updated last year
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,692Updated this week