cuDF - GPU DataFrame Library
☆9,498Updated this week
Alternatives and similar repositories for cudf
Users that are interested in cudf are comparing it to the libraries listed below
Sorting:
- cuML - RAPIDS Machine Learning Library☆5,122Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,362Feb 10, 2026Updated 2 weeks ago
- NumPy & SciPy for GPU☆10,804Updated this week
- Parallel computing with task scheduling☆13,746Feb 22, 2026Updated last week
- cuGraph - RAPIDS Graph Analytics Library☆2,128Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆34,940Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,475Feb 5, 2026Updated 3 weeks ago
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,516Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,513Feb 23, 2026Updated last week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,543Updated this week
- The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, …☆24,485Updated this week
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆2,007Sep 16, 2022Updated 3 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,675Dec 1, 2025Updated 3 months ago
- NumPy aware dynamic Python compiler using LLVM☆10,921Feb 20, 2026Updated last week
- 🦉 Data Versioning and ML Experiments☆15,404Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,860Feb 21, 2026Updated last week
- the portable Python dataframe library☆6,417Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆39,195Updated this week
- A hyperparameter optimization framework☆13,583Updated this week
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,035Updated this week
- A game theoretic approach to explain the output of any machine learning model.☆25,072Feb 20, 2026Updated last week
- An open source python library for automated feature engineering☆7,614Feb 3, 2026Updated 3 weeks ago
- Build, Manage and Deploy AI/ML Systems☆9,863Updated this week
- Streamlit — A faster way to build and share data apps.☆43,634Updated this week
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆18,103Updated this week
- A scikit-learn compatible neural network library that wraps PyTorch☆6,152Updated this week
- Development repository for the Triton language and compiler☆18,501Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,697Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,706Updated this week
- Automatic extraction of relevant features from time series:☆9,127Nov 15, 2025Updated 3 months ago
- 📚 Parameterize, execute, and analyze notebooks☆6,388Jan 5, 2026Updated last month
- Declarative visualization library for Python☆10,276Updated this week
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆20,039Feb 20, 2026Updated last week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,399Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,768Updated this week
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,529Jul 17, 2025Updated 7 months ago
- Deep universal probabilistic programming with Python and PyTorch☆8,983Jul 9, 2025Updated 7 months ago
- Hydra is a framework for elegantly configuring complex applications☆10,231Feb 7, 2026Updated 3 weeks ago
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,169Oct 29, 2025Updated 4 months ago