TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
☆710Oct 14, 2023Updated 2 years ago
Alternatives and similar repositories for TonY
Users that are interested in TonY are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,858Jul 10, 2023Updated 2 years ago
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.☆128May 9, 2020Updated 5 years ago
- Integration of TensorFlow with other open-source frameworks☆1,374Sep 25, 2024Updated last year
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,369Aug 22, 2023Updated 2 years ago
- Submarine is Cloud Native Machine Learning Platform.☆705Apr 3, 2024Updated last year
- Apache YuniKorn Core☆1,004Updated this week
- AI on Hadoop☆1,729Jul 1, 2025Updated 8 months ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆298Apr 22, 2024Updated last year
- A scalable machine learning library on Apache Spark☆796Aug 30, 2021Updated 4 years ago
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,784Oct 13, 2025Updated 5 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,679Dec 1, 2025Updated 3 months ago
- Secure HDFS Access from Kubernetes☆61Jun 11, 2020Updated 5 years ago
- Train TensorFlow models on YARN in just a few lines of code!☆93Nov 3, 2023Updated 2 years ago
- A high performance and generic framework for distributed DNN training☆3,716Oct 3, 2023Updated 2 years ago
- A tool and library for easily deploying applications on Apache YARN☆146Mar 12, 2024Updated 2 years ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆134Jan 11, 2024Updated 2 years ago
- An industrial deep learning framework for high-dimension sparse data☆4,308Sep 25, 2024Updated last year
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,879Jan 2, 2026Updated 2 months ago
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,272Sep 29, 2023Updated 2 years ago
- Machine Learning Toolkit for Kubernetes☆15,527Jan 5, 2026Updated 2 months ago
- Kubernetes-native Deep Learning Framework☆744Jan 26, 2024Updated 2 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆893Mar 10, 2026Updated last week
- Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep le…☆694Nov 12, 2024Updated last year
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,732Jan 28, 2026Updated last month
- A lightweight parameter server interface☆1,561Mar 2, 2026Updated 3 weeks ago
- Distributed Factorization Machines☆299Mar 23, 2016Updated 10 years ago
- Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.☆3,619Jun 7, 2024Updated last year
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,167Apr 29, 2025Updated 10 months ago
- MLeap: Deploy ML Pipelines to Production☆1,535Mar 10, 2026Updated last week
- Apache Spark - A unified analytics engine for large-scale data processing☆16Jul 24, 2023Updated 2 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,311Updated this week
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Apr 6, 2022Updated 3 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆129Sep 7, 2018Updated 7 years ago
- Simple and Distributed Machine Learning☆5,215Updated this week
- a TensorFlow-based distributed training framework optimized for large-scale sparse data.☆333Dec 23, 2025Updated 3 months ago
- Resource scheduling and cluster management for AI☆2,682Jun 6, 2024Updated last year
- Brings SQL and AI together.☆5,189Apr 18, 2024Updated last year
- CTR prediction models based on deep learning(基于深度学习的广告推荐CTR预估模型)☆934Nov 15, 2019Updated 6 years ago
- SQL-based streaming analytics platform at scale☆1,225Jun 21, 2020Updated 5 years ago