IntelPython / sdcLinks
Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler
☆642Updated 2 years ago
Alternatives and similar repositories for sdc
Users that are interested in sdc are comparing it to the libraries listed below
Sorting:
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆649Updated last week
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆933Updated 3 years ago
- Lazydata: Scalable data dependencies for Python projects☆620Updated 6 years ago
- Data Migration for the Blaze Project☆1,004Updated 3 years ago
- A columnar data container that can be compressed.☆958Updated 3 years ago
- Design documents and code for the pandas 2.0 effort.☆306Updated 7 years ago
- Parallel Programming with Python and Charm++☆296Updated 4 months ago
- 🎯 A comprehensive gradient-free optimization framework written in Python☆580Updated 6 years ago
- A library for reading text files over multiple cores.☆1,055Updated 2 years ago
- [ARCHIVED] C GPU DataFrame Library☆139Updated 7 years ago
- Python library for building highly effective data science workflows☆948Updated 2 years ago
- Run IPython notebooks as command-line scripts, generate HTML reports☆452Updated 7 years ago
- Python bindings for ArrayFire: A general purpose GPU library.☆419Updated 2 years ago
- t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark☆404Updated 2 years ago
- Compiled, automatically parallel Python for data science☆490Updated 8 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆271Updated last year
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆240Updated 7 years ago
- Perform high-speed calculations on columnar data without creating intermediate objects.☆81Updated 7 years ago
- Describing statistical models in Python using symbolic formulas☆978Updated last week
- A distributed task scheduler for Dask☆1,656Updated this week
- Scalable Machine Learning with Dask☆942Updated 2 months ago
- A library for defensive data analysis.☆502Updated 5 years ago
- Robust and reusable Executor for joblib☆595Updated 3 months ago
- Language defining a data description protocol☆185Updated 2 years ago
- Repeatable analysis plugin for Jupyter notebook☆261Updated 3 years ago
- Write reproducible reports in Markdown☆439Updated 6 years ago
- Declarative statistical visualization library for Python☆237Updated 7 years ago
- A Python module for parallel optimization of expensive black-box functions☆446Updated last month
- 64bit multithreaded python data analytics tools for numpy arrays and datasets☆387Updated last year