xorbitsai / xorbits
Scalable Python DS & ML, in an API compatible & lightning fast way.
☆1,175Updated 3 weeks ago
Alternatives and similar repositories for xorbits:
Users that are interested in xorbits are comparing it to the libraries listed below
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,722Updated last year
- 🏕️ Reproducible development environment☆2,109Updated this week
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆331Updated 2 weeks ago
- Easy to use Python library of customized functions for cleaning and analyzing data.☆510Updated 3 months ago
- python implementation of the parquet columnar file format.☆824Updated last month
- A @ClickHouse fork that supports high-performance vector search and full-text search.☆952Updated 2 months ago
- Extended pickling support for Python objects☆1,743Updated 3 weeks ago
- Fast NumPy array functions written in C☆1,107Updated 2 weeks ago
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆629Updated last year
- Distributed SQL Engine in Python using Dask☆402Updated 7 months ago
- A specification that python filesystems should adhere to.☆1,146Updated 3 weeks ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,072Updated 3 weeks ago
- A Python package for manipulating 2-dimensional tabular data structures☆1,838Updated last month
- Modin: Scale your Pandas workflows by changing a single line of code☆10,116Updated this week
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,219Updated last week
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆839Updated last week
- A Python package for easy multiprocessing, but faster than multiprocessing☆2,056Updated 8 months ago
- Manipulate JSON-like data with NumPy-like idioms.☆875Updated this week
- Python package for statistical data animations☆359Updated last year
- Apache DataFusion Python Bindings☆441Updated last week
- More styles and useful extensions for Matplotlib☆812Updated 2 years ago
- A JupyterLab extension for displaying cell timings☆383Updated 4 months ago
- Distributed XGBoost on Ray☆148Updated 9 months ago
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆293Updated 2 weeks ago
- EvalML is an AutoML library written in python.☆805Updated last week
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆1,960Updated 2 years ago
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆4,507Updated this week
- It is a high-performance causal inference (statistical model) computing library based on OLAP, which solves the performance bottleneck of…☆148Updated 4 months ago
- Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.☆2,151Updated 9 months ago
- A Survey of AI startups☆397Updated last year