BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
☆2,693Feb 5, 2026Updated 3 weeks ago
Alternatives and similar repositories for BigDL
Users that are interested in BigDL are comparing it to the libraries listed below
Sorting:
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,694Jan 28, 2026Updated last month
- Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep le…☆695Nov 12, 2024Updated last year
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,858Jul 10, 2023Updated 2 years ago
- Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.☆3,623Jun 7, 2024Updated last year
- Simple and Distributed Machine Learning☆5,200Feb 14, 2026Updated 2 weeks ago
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,784Oct 13, 2025Updated 4 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,675Dec 1, 2025Updated 2 months ago
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,272Sep 29, 2023Updated 2 years ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆368Feb 1, 2026Updated last month
- MLeap: Deploy ML Pipelines to Production☆1,535Jan 12, 2026Updated last month
- Brings SQL and AI together.☆5,191Apr 18, 2024Updated last year
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,845May 29, 2024Updated last year
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,516Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,602Feb 21, 2026Updated last week
- The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, …☆24,365Updated this week
- A high performance and generic framework for distributed DNN training☆3,716Oct 3, 2023Updated 2 years ago
- Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL☆210Jan 3, 2023Updated 3 years ago
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,157Apr 29, 2025Updated 10 months ago
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,339Jul 3, 2024Updated last year
- State of the Art Natural Language Processing☆4,109Updated this week
- Machine Learning Toolkit for Kubernetes☆15,462Jan 5, 2026Updated last month
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,879Jan 2, 2026Updated last month
- Upserts, Deletes And Incremental Processing on Big Data.☆6,098Updated this week
- Resource scheduling and cluster management for AI☆2,687Jun 6, 2024Updated last year
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,304Updated this week
- Distributed Deep learning with Keras & Spark☆1,578May 1, 2023Updated 2 years ago
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,500Updated this week
- Submarine is Cloud Native Machine Learning Platform.☆705Apr 3, 2024Updated last year
- AI on Hadoop☆1,732Jul 1, 2025Updated 7 months ago
- High performance data store solution☆1,446Feb 21, 2026Updated last week
- AutoML library for deep learning☆9,309Nov 25, 2025Updated 3 months ago
- An open source python library for automated feature engineering☆7,614Feb 3, 2026Updated 3 weeks ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,730Feb 16, 2026Updated last week
- An Industrial Grade Federated Learning Framework☆6,048Nov 19, 2024Updated last year
- Apache Flink☆25,825Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,362Feb 10, 2026Updated 2 weeks ago
- REST job server for Apache Spark☆2,842Jul 8, 2025Updated 7 months ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆297Apr 22, 2024Updated last year
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,035Updated this week