IntelPython / sdcLinks
Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler
☆642Updated last year
Alternatives and similar repositories for sdc
Users that are interested in sdc are comparing it to the libraries listed below
Sorting:
- A columnar data container that can be compressed.☆957Updated 2 years ago
- Data Migration for the Blaze Project☆1,003Updated 3 years ago
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆933Updated 2 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- Design documents and code for the pandas 2.0 effort.☆304Updated 6 years ago
- Robust and reusable Executor for joblib☆583Updated last month
- A Python module for parallel optimization of expensive black-box functions☆446Updated last year
- Lazydata: Scalable data dependencies for Python projects☆621Updated 6 years ago
- Parallel Programming with Python and Charm++☆295Updated 2 months ago
- Perform high-speed calculations on columnar data without creating intermediate objects.☆81Updated 6 years ago
- IPython magic command to profile and view your python code as a heat map.☆1,033Updated last year
- A consistent table management library in python☆160Updated 2 years ago
- Run IPython notebooks as command-line scripts, generate HTML reports☆452Updated 7 years ago
- Compiled, automatically parallel Python for data science☆490Updated 8 years ago
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆401Updated 2 years ago
- 🎯 A comprehensive gradient-free optimization framework written in Python☆580Updated 6 years ago
- Python bindings for ArrayFire: A general purpose GPU library.☆419Updated 2 years ago
- A distributed task scheduler for Dask☆1,651Updated this week
- Compiled Decision Trees for scikit-learn☆228Updated 5 months ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆239Updated 6 years ago
- SCOOP (Scalable COncurrent Operations in Python)☆653Updated 2 years ago
- Language defining a data description protocol☆185Updated 2 years ago
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆946Updated 2 weeks ago
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆936Updated last year
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆136Updated 6 years ago
- Real-time stream processing for python☆1,284Updated 10 months ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆154Updated 8 years ago
- [ARCHIVED] C GPU DataFrame Library☆139Updated 6 years ago
- A Python wrapper for the extremely fast Blosc compression library☆358Updated 3 weeks ago
- 💥 Cython memory pool for RAII-style memory management☆458Updated 4 months ago