IntelPython / sdc
Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler
☆643Updated last year
Alternatives and similar repositories for sdc:
Users that are interested in sdc are comparing it to the libraries listed below
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- A columnar data container that can be compressed.☆957Updated 2 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆627Updated this week
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆935Updated 2 years ago
- Lazydata: Scalable data dependencies for Python projects☆623Updated 6 years ago
- A distributed task scheduler for Dask☆1,606Updated this week
- Data Migration for the Blaze Project☆1,004Updated 2 years ago
- A library for defensive data analysis.☆501Updated 5 years ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆240Updated 6 years ago
- Airspeed Velocity: A simple Python benchmarking tool with web-based reporting☆890Updated last week
- Python library for building highly effective data science workflows☆948Updated last year
- Scalable Machine Learning with Dask☆922Updated last month
- Write reproducible reports in Markdown☆440Updated 6 years ago
- PySchemes is a library for validating data structures in python☆366Updated 2 years ago
- Robust and reusable Executor for joblib☆553Updated this week
- Python exposure of dynd☆120Updated 2 years ago
- Cython implementation of Toolz: High performance functional utilities☆1,029Updated 2 months ago
- Design documents and code for the pandas 2.0 effort.☆303Updated 6 years ago
- Dataflow programming for python.☆289Updated last year
- A library for reading text files over multiple cores.☆1,055Updated last year
- Benchmark for different operations in pandas against various dataframe sizes.☆965Updated 6 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- Caching based on computation time and storage space☆135Updated 4 years ago
- Python bindings for ArrayFire: A general purpose GPU library.☆419Updated last year
- Real-time stream processing for python☆1,257Updated 3 months ago
- Parallel Programming with Python and Charm++☆294Updated last week
- Tools for test driven data-wrangling and data validation.☆294Updated 3 years ago
- A light-weight wrapper library around Spotify's Luigi workflow library to make writing scientific workflows more fluent, flexible and mod…☆335Updated 2 months ago
- Feature engineering and machine learning: together at last!☆24Updated 4 years ago
- Studio: Simplify and expedite model building process☆381Updated 8 months ago