Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
☆2,747Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for mars
Users that are interested in mars are comparing it to the libraries listed below
Sorting:
- Scalable Python DS & ML, in an API compatible & lightning fast way.☆1,203Feb 14, 2026Updated last month
- Modin: Scale your Pandas workflows by changing a single line of code☆10,363Feb 10, 2026Updated last month
- Parallel computing with task scheduling☆13,765Mar 12, 2026Updated last week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,799Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,492Mar 1, 2026Updated 3 weeks ago
- vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)☆945Jan 22, 2026Updated last month
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,679Dec 1, 2025Updated 3 months ago
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,347Jul 3, 2024Updated last year
- cuDF - GPU DataFrame Library☆9,558Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆35,108Updated this week
- A high performance and generic framework for distributed DNN training☆3,716Oct 3, 2023Updated 2 years ago
- An open source python library for automated feature engineering☆7,623Feb 3, 2026Updated last month
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆2,008Sep 16, 2022Updated 3 years ago
- Brings SQL and AI together.☆5,189Apr 18, 2024Updated last year
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,138Updated this week
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆370Updated this week
- An industrial deep learning framework for high-dimension sparse data☆4,308Sep 25, 2024Updated last year
- NumPy & SciPy for GPU☆10,847Mar 14, 2026Updated last week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, a…☆24,874Updated this week
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,784Oct 13, 2025Updated 5 months ago
- Open Machine Learning Compiler Framework☆13,197Updated this week
- High-performance runtime for data analytics applications☆3,002Jun 22, 2022Updated 3 years ago
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,597Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆39,403Updated this week
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆18,165Updated this week
- AutoML library for deep learning☆9,310Nov 25, 2025Updated 3 months ago
- A Python package for manipulating 2-dimensional tabular data structures☆1,882Mar 17, 2025Updated last year
- Low-code framework for building custom LLMs, neural networks, and other AI models☆11,657Updated this week
- Build, Manage and Deploy AI/ML Systems☆9,956Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.☆4,079Updated this week
- cuML - RAPIDS Machine Learning Library☆5,148Updated this week
- Automated Machine Learning with scikit-learn☆8,072Jan 20, 2026Updated 2 months ago
- Python package built to ease deep learning on graph, on top of existing DL frameworks.☆14,245Jul 31, 2025Updated 7 months ago
- 🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统☆3,538Mar 9, 2026Updated last week
- A natural language modeling framework based on PyTorch☆6,306Oct 17, 2022Updated 3 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,823Oct 25, 2023Updated 2 years ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,531Jul 17, 2025Updated 8 months ago
- Generate embeddings from large-scale graph-structured data.☆3,459Mar 3, 2024Updated 2 years ago
- Fast and flexible AutoML with learning guarantees.☆3,456Nov 30, 2023Updated 2 years ago