Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
☆2,749Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for mars
Users that are interested in mars are comparing it to the libraries listed below
Sorting:
- Modin: Scale your Pandas workflows by changing a single line of code☆10,362Feb 10, 2026Updated 2 weeks ago
- Parallel computing with task scheduling☆13,746Feb 22, 2026Updated last week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,516Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,475Feb 5, 2026Updated 3 weeks ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,675Dec 1, 2025Updated 3 months ago
- vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)☆946Jan 22, 2026Updated last month
- cuDF - GPU DataFrame Library☆9,498Updated this week
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,339Jul 3, 2024Updated last year
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆34,940Updated this week
- An open source python library for automated feature engineering☆7,614Feb 3, 2026Updated 3 weeks ago
- A high performance and generic framework for distributed DNN training☆3,716Oct 3, 2023Updated 2 years ago
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆2,007Sep 16, 2022Updated 3 years ago
- Brings SQL and AI together.☆5,191Apr 18, 2024Updated last year
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,035Updated this week
- The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, …☆24,485Updated this week
- Open Machine Learning Compiler Framework☆13,142Updated this week
- NumPy & SciPy for GPU☆10,804Updated this week
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,784Oct 13, 2025Updated 4 months ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆368Feb 1, 2026Updated last month
- An industrial deep learning framework for high-dimension sparse data☆4,307Sep 25, 2024Updated last year
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆18,103Updated this week
- Low-code framework for building custom LLMs, neural networks, and other AI models☆11,651Updated this week
- AutoML library for deep learning☆9,308Nov 25, 2025Updated 3 months ago
- High-performance runtime for data analytics applications☆3,005Jun 22, 2022Updated 3 years ago
- A Python package for manipulating 2-dimensional tabular data structures☆1,883Mar 17, 2025Updated 11 months ago
- Build, Manage and Deploy AI/ML Systems☆9,863Updated this week
- cuML - RAPIDS Machine Learning Library☆5,122Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆39,195Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,543Updated this week
- Python package built to ease deep learning on graph, on top of existing DL frameworks.☆14,238Jul 31, 2025Updated 7 months ago
- Automated Machine Learning with scikit-learn☆8,058Jan 20, 2026Updated last month
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,049Sep 11, 2025Updated 5 months ago
- A Python toolbox for performing gradient-free optimization☆4,151May 30, 2025Updated 9 months ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,829Oct 25, 2023Updated 2 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,879Jan 2, 2026Updated last month
- Deep universal probabilistic programming with Python and PyTorch☆8,983Jul 9, 2025Updated 7 months ago
- 🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统☆3,535Feb 20, 2026Updated last week
- Machine Learning Toolkit for Kubernetes☆15,462Jan 5, 2026Updated last month
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,860Feb 21, 2026Updated last week