A Python package for manipulating 2-dimensional tabular data structures
☆1,878Mar 17, 2025Updated last year
Alternatives and similar repositories for datatable
Users that are interested in datatable are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,507Apr 1, 2026Updated last month
- Modin: Scale your Pandas workflows by changing a single line of code☆10,391Feb 10, 2026Updated 3 months ago
- R's data.table package extends data.frame:☆3,890Updated this week
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,640Mar 20, 2024Updated 2 years ago
- A Grammar of Graphics for Python☆4,573May 21, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- cuDF - GPU DataFrame Library☆9,643Updated this week
- reproducible benchmark of database-like ops☆347Jun 29, 2023Updated 2 years ago
- Parallel computing with task scheduling☆13,845May 22, 2026Updated last week
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,756Dec 8, 2025Updated 5 months ago
- the portable Python dataframe library☆6,545May 20, 2026Updated last week
- Lightning Fast Serialization of Data Frames for R☆627Sep 26, 2024Updated last year
- An open source python library for automated feature engineering☆7,650Feb 3, 2026Updated 3 months ago
- Extremely fast Query Engine for DataFrames, written in Rust☆38,571May 22, 2026Updated last week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,567Apr 22, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Declarative visualization library for Python☆10,388May 19, 2026Updated last week
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,482May 20, 2026Updated last week
- Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.☆2,241Jun 27, 2024Updated last year
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,775Apr 8, 2026Updated last month
- A light-weight, flexible, and expressive statistical data testing library☆4,356May 21, 2026Updated last week
- 📚 Parameterize, execute, and analyze notebooks☆6,447May 12, 2026Updated 2 weeks ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,494Updated this week
- A library of sklearn compatible categorical variable encoders☆2,491May 5, 2026Updated 3 weeks ago
- Python library for using dplyr like syntax with pandas and SQL☆1,184Sep 24, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆5,145May 20, 2026Updated last week
- Automatic extraction of relevant features from time series:☆9,219Nov 15, 2025Updated 6 months ago
- A game theoretic approach to explain the output of any machine learning model.☆25,470Updated this week
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,804Jul 9, 2024Updated last year
- A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other ma…☆8,966Updated this week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆658Updated this week
- Voilà turns Jupyter notebooks into standalone web applications☆5,932May 4, 2026Updated 3 weeks ago
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,399Feb 19, 2025Updated last year
- PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)☆548Oct 20, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Fast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data☆594Sep 10, 2024Updated last year
- Data table backend for dplyr☆675Feb 11, 2026Updated 3 months ago
- R package: future: Unified Parallel and Distributed Processing in R for Everyone☆1,012May 22, 2026Updated last week
- STUMPY is a powerful and scalable Python library for modern time series analysis☆4,096May 15, 2026Updated 2 weeks ago
- Fit interpretable models. Explain blackbox machine learning.☆6,864Updated this week
- Always know what to expect from your data.☆11,525May 21, 2026Updated last week
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆20,188May 8, 2026Updated 3 weeks ago