Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
☆2,744Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for mars
Users that are interested in mars are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scalable Python DS & ML, in an API compatible & lightning fast way.☆1,204Feb 14, 2026Updated 2 months ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,382Feb 10, 2026Updated 2 months ago
- Parallel computing with task scheduling☆13,807Apr 22, 2026Updated last week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆42,373Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,501Apr 1, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)☆951Jan 22, 2026Updated 3 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,692Dec 1, 2025Updated 4 months ago
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,351Jul 3, 2024Updated last year
- cuDF - GPU DataFrame Library☆9,612Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆35,484Updated this week
- A high performance and generic framework for distributed DNN training☆3,715Oct 3, 2023Updated 2 years ago
- An open source python library for automated feature engineering☆7,630Feb 3, 2026Updated 2 months ago
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆2,010Sep 16, 2022Updated 3 years ago
- Brings SQL and AI together.☆5,188Apr 18, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,324Apr 24, 2026Updated last week
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆371Apr 10, 2026Updated 2 weeks ago
- An industrial deep learning framework for high-dimension sparse data☆4,304Sep 25, 2024Updated last year
- NumPy & SciPy for GPU☆10,914Apr 24, 2026Updated last week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, a…☆25,551Updated this week
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,787Oct 13, 2025Updated 6 months ago
- Open Machine Learning Compiler Framework☆13,304Updated this week
- High-performance runtime for data analytics applications☆3,002Apr 13, 2026Updated 2 weeks ago
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,703Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A library for efficient similarity search and clustering of dense vectors.☆39,856Updated this week
- AutoML library for deep learning☆9,313Nov 25, 2025Updated 5 months ago
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆18,295Updated this week
- A Python package for manipulating 2-dimensional tabular data structures☆1,880Mar 17, 2025Updated last year
- Low-code framework for building custom LLMs, neural networks, and other AI models☆11,680Updated this week
- Automated Machine Learning with scikit-learn☆8,089Apr 21, 2026Updated last week
- Build, Manage and Deploy AI/ML Systems☆10,063Updated this week
- A natural language modeling framework based on PyTorch☆6,301Oct 17, 2022Updated 3 years ago
- A composable and fully extensible C++ execution engine library for data management systems.☆4,112Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- cuML - RAPIDS Machine Learning Library☆5,181Updated this week
- Python package built to ease deep learning on graph, on top of existing DL frameworks.☆14,271Jul 31, 2025Updated 9 months ago
- 🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统☆3,551Apr 21, 2026Updated last week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,810Oct 25, 2023Updated 2 years ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,534Jul 17, 2025Updated 9 months ago
- Generate embeddings from large-scale graph-structured data.☆3,458Mar 3, 2024Updated 2 years ago
- Fast and flexible AutoML with learning guarantees.☆3,458Nov 30, 2023Updated 2 years ago