h2oai / datatable
A Python package for manipulating 2-dimensional tabular data structures
☆1,818Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for datatable
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,364Updated this week
- Data Analysis Baseline Library☆724Updated 3 months ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,541Updated 8 months ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,481Updated this week
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,686Updated 4 months ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,742Updated 3 years ago
- Extra blocks for scikit-learn pipelines.☆1,278Updated this week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,293Updated last month
- Scalable Machine Learning with Dask☆902Updated 3 months ago
- Dask tutorial☆1,832Updated last year
- Easy pipelines for pandas DataFrames.☆716Updated 2 weeks ago
- Real-time stream processing for python☆1,244Updated 5 months ago
- Feature engineering package with sklearn like functionality☆1,927Updated last week
- dplyr-style piping operations for pandas dataframes☆889Updated 2 years ago
- A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews☆1,137Updated last week
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,013Updated last week
- Prepping tables for machine learning☆1,218Updated this week
- A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.☆1,884Updated 4 months ago
- the portable Python dataframe library☆5,318Updated this week
- Describing statistical models in Python using symbolic formulas☆954Updated this week
- Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.☆2,068Updated 4 months ago
- sqldf for pandas☆1,342Updated 3 months ago
- 📚 Parameterize, execute, and analyze notebooks☆5,977Updated last month
- A Grammar of Graphics for Python☆4,048Updated this week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆623Updated last week
- python implementation of the parquet columnar file format.☆787Updated last week
- High-level tools to simplify visualization in Python.☆845Updated this week
- Missing data visualization module for Python.☆3,963Updated 6 months ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,758Updated 2 years ago
- Python library for building highly effective data science workflows☆952Updated last year