IntelPython / sdc
Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler
☆646Updated 10 months ago
Related projects: ⓘ
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- Scalable Machine Learning with Dask☆892Updated last month
- Data Migration for the Blaze Project☆1,000Updated 2 years ago
- A columnar data container that can be compressed.☆959Updated last year
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆614Updated this week
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆934Updated last year
- A library for defensive data analysis.☆500Updated 4 years ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆240Updated 5 years ago
- A distributed task scheduler for Dask☆1,568Updated this week
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆381Updated last year
- Interactive plotting for Pandas using Vega-Lite☆344Updated 5 years ago
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆861Updated last week
- Describing statistical models in Python using symbolic formulas☆941Updated 3 months ago
- Parallel Programming with Python and Charm++☆289Updated this week
- Extended pickling support for Python objects☆1,633Updated last month
- python implementation of the parquet columnar file format.☆767Updated last week
- Design documents and code for the pandas 2.0 effort.☆306Updated 5 years ago
- Robust and reusable Executor for joblib☆527Updated last month
- Cython implementation of Toolz: High performance functional utilities☆997Updated 2 months ago
- Sparkling Pandas☆363Updated last year
- Python library for building highly effective data science workflows☆951Updated last year
- A library for reading text files over multiple cores.☆1,061Updated last year
- Studio: Simplify and expedite model building process☆379Updated 2 months ago
- Compiled Decision Trees for scikit-learn☆224Updated 4 months ago
- Dataflow programming for python.☆284Updated last year
- Feature engineering and machine learning: together at last!☆23Updated 3 years ago
- IPython magic command to profile and view your python code as a heat map.☆1,026Updated 2 months ago
- 64bit multithreaded python data analytics tools for numpy arrays and datasets☆364Updated 4 months ago
- Tools for test driven data-wrangling and data validation.☆292Updated 2 years ago
- A Python wrapper for the extremely fast Blosc compression library☆350Updated 2 weeks ago