IntelPython / sdc
Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler
☆643Updated last year
Alternatives and similar repositories for sdc:
Users that are interested in sdc are comparing it to the libraries listed below
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆934Updated 2 years ago
- Data Migration for the Blaze Project☆1,003Updated 2 years ago
- Parallel Programming with Python and Charm++☆293Updated last week
- A columnar data container that can be compressed.☆958Updated 2 years ago
- Scalable Machine Learning with Dask☆916Updated 2 months ago
- A library for defensive data analysis.☆500Updated 5 years ago
- Interactive plotting for Pandas using Vega-Lite☆344Updated 5 years ago
- Lazydata: Scalable data dependencies for Python projects☆624Updated 5 years ago
- Python library for building highly effective data science workflows☆949Updated last year
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆626Updated this week
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆240Updated 6 years ago
- Describing statistical models in Python using symbolic formulas☆960Updated last week
- Feature engineering and machine learning: together at last!☆24Updated 4 years ago
- Interactive plotting for Python.☆435Updated 4 months ago
- Robust and reusable Executor for joblib☆548Updated 3 months ago
- GraphBLAS for Python☆343Updated last year
- Run IPython notebooks as command-line scripts, generate HTML reports☆450Updated 6 years ago
- [ARCHIVED] C GPU DataFrame Library☆138Updated 6 years ago
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆136Updated 5 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- Tools for test driven data-wrangling and data validation.☆295Updated 3 years ago
- IPython magic command to profile and view your python code as a heat map.☆1,032Updated 6 months ago
- ☆162Updated 3 years ago
- A distributed task scheduler for Dask☆1,595Updated this week
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆877Updated last week
- Benchmark for different operations in pandas against various dataframe sizes.☆965Updated 6 years ago
- Repeatable analysis plugin for Jupyter notebook☆260Updated 2 years ago
- persistent caching to memory, disk, or database☆262Updated 3 weeks ago
- Studio: Simplify and expedite model building process☆381Updated 6 months ago