Pandas ExtensionDType/Array backed by Apache Arrow
☆232Feb 22, 2023Updated 3 years ago
Alternatives and similar repositories for fletcher
Users that are interested in fletcher are comparing it to the libraries listed below
Sorting:
- A consistent table management library in python☆160May 15, 2023Updated 2 years ago
- A factory for simplekv-Store-based storage classes.☆24Jan 13, 2024Updated 2 years ago
- ArrayViews: creating specific views to array storage objects☆16Feb 6, 2019Updated 7 years ago
- IP Address dtype and block for pandas☆106Jul 31, 2023Updated 2 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆654Updated this week
- SQL on dataframes - pandas and dask☆64Apr 25, 2018Updated 7 years ago
- Caching based on computation time and storage space☆140Feb 10, 2021Updated 5 years ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Apr 9, 2020Updated 5 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated this week
- Press conda packages into wheels☆118Jun 18, 2023Updated 2 years ago
- Generate conda environment.yml from PEP 621 and/or flit config.☆11Sep 2, 2021Updated 4 years ago
- [DEVELOPMENT CONTINUES HERE: https://github.com/Pybonacci/jupy2wp] Publish an IPython notebook on a wordpress site using xmlrpc☆17May 20, 2016Updated 9 years ago
- Versatile, high-performance histogram toolkit for Numpy.☆109Nov 6, 2018Updated 7 years ago
- Perform high-speed calculations on columnar data without creating intermediate objects.☆81Nov 8, 2018Updated 7 years ago
- general functions for your data .pipe()-lines.☆17Nov 8, 2023Updated 2 years ago
- Notes and experiments in Jupyter dashboarding☆16Apr 17, 2021Updated 4 years ago
- Reproducibility for Humans: A lightweight tool to perform reproducible machine learning experiment.☆24Apr 24, 2019Updated 6 years ago
- Vectorized processing for Apache Arrow☆484Feb 14, 2022Updated 4 years ago
- Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler☆643Nov 9, 2023Updated 2 years ago
- Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.☆528Feb 11, 2026Updated 3 weeks ago
- Streaming and approximate algorithms. WIP, use at own risk.☆27Sep 4, 2025Updated 6 months ago
- Real-time stream processing for python☆1,294Feb 24, 2026Updated last week
- The stupidest database of all time.☆56Feb 2, 2026Updated last month
- the portable Python dataframe library☆6,440Updated this week
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- Brushing and linking for big data☆972Dec 2, 2025Updated 3 months ago
- SQLAlchemy dialect for EXASOL☆36Updated this week
- SQLAlchemy for Dremio via the ODBC and Flight interface.☆30Jan 8, 2026Updated last month
- Experimental support for serializing DataFusion plans using substrait☆46Jan 13, 2023Updated 3 years ago
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆137Jun 27, 2019Updated 6 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,754Dec 8, 2025Updated 2 months ago
- An experiment in trying to define a core and cleaned-up NumPy API: RNumPy☆13Feb 19, 2021Updated 5 years ago
- ☆10Oct 12, 2023Updated 2 years ago
- Set-oriented Operations in Pandas☆24May 27, 2020Updated 5 years ago
- Show effects of over-subscription and ways to fix that☆16Aug 15, 2024Updated last year
- Python binding for DataFusion☆59Jul 22, 2022Updated 3 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,483Updated this week
- ☆23May 2, 2024Updated last year
- Manipulate arrays of complex data structures as easily as Numpy.☆214Feb 8, 2021Updated 5 years ago