mars-project / mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
☆2,708Updated last year
Alternatives and similar repositories for mars:
Users that are interested in mars are comparing it to the libraries listed below
- A high performance and generic framework for distributed DNN training☆3,655Updated last year
- Scalable Python DS & ML, in an API compatible & lightning fast way.☆1,139Updated this week
- 腾讯高性能分布式图计算框架Plato☆1,902Updated 3 years ago
- Parallel computing with task scheduling☆12,851Updated this week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,810Updated last year
- vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)☆845Updated 2 weeks ago
- A distributed task scheduler for Dask☆1,589Updated this week
- Kubernetes-native Deep Learning Framework☆733Updated 11 months ago
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆1,941Updated 2 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,321Updated 3 months ago
- Extended pickling support for Python objects☆1,688Updated this week
- An open source python library for automated feature engineering☆7,334Updated this week
- Universal model exchange and serialization format for decision tree forests☆750Updated this week
- A Python toolbox for performing gradient-free optimization☆3,984Updated last month
- Computing with Python functions.☆3,938Updated this week
- Distributed Computing for AI Made Simple☆1,041Updated last year
- Source-to-Source Debuggable Derivatives in Pure Python☆2,315Updated 2 years ago
- Bagua Speeds up PyTorch☆877Updated 5 months ago
- A distributed graph deep learning framework.☆2,901Updated last year
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,838Updated 7 months ago
- Collective communications library with various primitives for multi-machine training.☆1,253Updated 2 weeks ago
- ThunderGBM: Fast GBDTs and Random Forests on GPUs☆694Updated 11 months ago
- Scalable Machine Learning with Dask☆912Updated last month
- Adaptive Experimentation Platform☆2,406Updated this week
- parallel graph management and execution in heterogeneous computing☆1,415Updated 2 weeks ago
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆34,903Updated this week
- An Industrial Graph Neural Network Framework☆1,293Updated 6 months ago
- Resource scheduling and cluster management for AI☆2,646Updated 7 months ago
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,306Updated 3 months ago
- Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more☆2,266Updated last month