IntelPython / sdc
Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler
☆646Updated last year
Related projects ⓘ
Alternatives and complementary repositories for sdc
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆936Updated 2 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆623Updated last week
- Robust and reusable Executor for joblib☆538Updated 3 weeks ago
- python implementation of the parquet columnar file format.☆787Updated last week
- A columnar data container that can be compressed.☆959Updated 2 years ago
- Real-time stream processing for python☆1,244Updated 5 months ago
- Cython implementation of Toolz: High performance functional utilities☆1,009Updated 2 weeks ago
- A library for defensive data analysis.☆501Updated 4 years ago
- Concurrent data pipelines in Python >>>☆1,549Updated last year
- Data Migration for the Blaze Project☆1,004Updated 2 years ago
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆875Updated 2 months ago
- SCOOP (Scalable COncurrent Operations in Python)☆635Updated last year
- Lazydata: Scalable data dependencies for Python projects☆624Updated 5 years ago
- PySchemes is a library for validating data structures in python☆365Updated 2 years ago
- Scalable Machine Learning with Dask☆902Updated 3 months ago
- Python bindings for ArrayFire: A general purpose GPU library.☆416Updated last year
- A multi-model machine learning feature embedding database☆633Updated 4 years ago
- Parallel Programming with Python and Charm++☆291Updated last week
- Describing statistical models in Python using symbolic formulas☆954Updated this week
- Extended pickling support for Python objects☆1,661Updated last month
- Tools for test driven data-wrangling and data validation.☆294Updated 2 years ago
- persistent caching to memory, disk, or database☆261Updated 2 weeks ago
- Language defining a data description protocol☆183Updated last year
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆240Updated 6 years ago
- A Python wrapper for the extremely fast Blosc compression library☆353Updated 2 months ago
- Feature engineering and machine learning: together at last!☆23Updated 3 years ago
- 🎯 A comprehensive gradient-free optimization framework written in Python☆576Updated 5 years ago
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆923Updated last year
- Python library for building highly effective data science workflows☆952Updated last year