xorbitsai / xorbits
Scalable Python DS & ML, in an API compatible & lightning fast way.
☆1,145Updated this week
Alternatives and similar repositories for xorbits:
Users that are interested in xorbits are comparing it to the libraries listed below
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,709Updated last year
- 🏕️ Reproducible development environment☆2,084Updated this week
- Python actor framework for heterogeneous computing.☆132Updated 2 months ago
- Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applicati…☆685Updated 6 months ago
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,115Updated this week
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆819Updated 2 weeks ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,816Updated last year
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆809Updated 10 months ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆324Updated last month
- A Python package for easy multiprocessing, but faster than multiprocessing☆2,039Updated 6 months ago
- Easy to use Python library of customized functions for cleaning and analyzing data.☆503Updated last month
- A Python package for manipulating 2-dimensional tabular data structures☆1,821Updated 3 months ago
- EvalML is an AutoML library written in python.☆801Updated this week
- A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews☆1,168Updated this week
- A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.☆4,042Updated this week
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.☆1,081Updated 7 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,040Updated 4 months ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,569Updated 10 months ago
- Universal model exchange and serialization format for decision tree forests☆750Updated last month
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,386Updated 3 weeks ago
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,717Updated 7 months ago
- OpenFE: automated feature generation with expert-level performance☆762Updated 8 months ago
- Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more☆2,271Updated last week
- A specification that python filesystems should adhere to.☆1,105Updated 2 weeks ago
- A lightweight version of Milvus☆298Updated this week
- Python SDK for Milvus.☆1,088Updated this week
- RayLLM - LLMs on Ray☆1,253Updated 8 months ago
- parallel graph management and execution in heterogeneous computing☆1,420Updated this week
- YLearn, a pun of "learn why", is a python package for causal inference☆410Updated 5 months ago
- A @ClickHouse fork that supports high-performance vector search and full-text search.☆920Updated last week