cuDF - GPU DataFrame Library
☆9,558Mar 18, 2026Updated this week
Alternatives and similar repositories for cudf
Users that are interested in cudf are comparing it to the libraries listed below
Sorting:
- cuML - RAPIDS Machine Learning Library☆5,148Updated this week
- cuGraph - RAPIDS Graph Analytics Library☆2,143Updated this week
- NumPy & SciPy for GPU☆10,847Mar 14, 2026Updated last week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,363Feb 10, 2026Updated last month
- Parallel computing with task scheduling☆13,765Mar 12, 2026Updated last week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆35,108Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,492Mar 1, 2026Updated 3 weeks ago
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,799Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,810Updated this week
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆2,008Sep 16, 2022Updated 3 years ago
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,597Updated this week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, a…☆24,874Updated this week
- NumPy aware dynamic Python compiler using LLVM☆10,939Mar 13, 2026Updated last week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,679Dec 1, 2025Updated 3 months ago
- 🦉 Data Versioning and ML Experiments☆15,458Updated this week
- the portable Python dataframe library☆6,457Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,926Mar 10, 2026Updated last week
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆137Jun 27, 2019Updated 6 years ago
- A library for efficient similarity search and clustering of dense vectors.☆39,403Updated this week
- A hyperparameter optimization framework☆13,721Updated this week
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,138Updated this week
- A game theoretic approach to explain the output of any machine learning model.☆25,131Mar 12, 2026Updated last week
- An open source python library for automated feature engineering☆7,623Feb 3, 2026Updated last month
- Streamlit — A faster way to build and share data apps.☆43,928Updated this week
- Build, Manage and Deploy AI/ML Systems☆9,956Updated this week
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,139Mar 12, 2026Updated last week
- A scikit-learn compatible neural network library that wraps PyTorch☆6,149Feb 25, 2026Updated 3 weeks ago
- Automatic extraction of relevant features from time series:☆9,151Nov 15, 2025Updated 4 months ago
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆18,165Mar 15, 2026Updated last week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,869Updated this week
- RAPIDS Memory Manager☆685Mar 14, 2026Updated last week
- Development repository for the Triton language and compiler☆18,708Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,910Updated this week
- Declarative visualization library for Python☆10,305Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆6,400Mar 8, 2026Updated 2 weeks ago
- CUDA-accelerated GIS and spatiotemporal algorithms☆699Jul 28, 2025Updated 7 months ago
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆20,083Mar 2, 2026Updated 2 weeks ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,531Jul 17, 2025Updated 8 months ago
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,438Mar 3, 2026Updated 2 weeks ago