IntelPython / sdc
Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler
☆643Updated last year
Alternatives and similar repositories for sdc:
Users that are interested in sdc are comparing it to the libraries listed below
- Scalable Machine Learning with Dask☆927Updated 2 months ago
- A distributed task scheduler for Dask☆1,624Updated this week
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆898Updated last month
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- Parallel Programming with Python and Charm++☆294Updated this week
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆629Updated last week
- Data Migration for the Blaze Project☆1,004Updated 2 years ago
- Cython implementation of Toolz: High performance functional utilities☆1,036Updated 3 months ago
- A library for defensive data analysis.☆500Updated 5 years ago
- Lazydata: Scalable data dependencies for Python projects☆623Updated 6 years ago
- Quilt is a data mesh for connecting people with actionable data☆1,333Updated this week
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆240Updated 6 years ago
- Tools for test driven data-wrangling and data validation.☆294Updated 3 years ago
- A columnar data container that can be compressed.☆956Updated 2 years ago
- Robust and reusable Executor for joblib☆558Updated 3 weeks ago
- A Python package to manage extremely large amounts of data☆1,327Updated last week
- A lightweight Traits like module☆637Updated 2 months ago
- GraphBLAS for Python☆344Updated last year
- Run IPython notebooks as command-line scripts, generate HTML reports☆450Updated 6 years ago
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,037Updated last month
- Fast NumPy array functions written in C☆1,106Updated last week
- Interactive plotting for Pandas using Vega-Lite☆344Updated 6 years ago
- Python library for building highly effective data science workflows☆950Updated last year
- Extended pickling support for Python objects☆1,742Updated 2 weeks ago
- Real-time stream processing for python☆1,257Updated 4 months ago
- Design documents and code for the pandas 2.0 effort.☆303Updated 6 years ago
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆136Updated 5 years ago
- Easy pipelines for pandas DataFrames.☆719Updated 5 months ago
- python implementation of the parquet columnar file format.☆821Updated 3 weeks ago
- 64bit multithreaded python data analytics tools for numpy arrays and datasets☆382Updated 11 months ago