jchacks / data_cacheLinks
Simple in memory data cache designed for ML applications. Built using Redis and Apache Arrow's Plasma in-memory store
☆11Updated 5 years ago
Alternatives and similar repositories for data_cache
Users that are interested in data_cache are comparing it to the libraries listed below
Sorting:
- Unified Distributed Execution☆57Updated last year
- Derivatives models written with the Tributary data flow library☆24Updated last month
- 🎯 aimrocks 🎸 — python & cython bindings for RocksDB. Batteries included! 🔋☆33Updated last month
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- Apache Arrow Flight example☆11Updated 5 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Convenient pyarrow operations following the Pandas API☆45Updated 4 years ago
- Python binding for Khiva library.☆46Updated last year
- Python driver for Timeplus Enterprise or Timeplus Proton☆17Updated last year
- Ray provider for Apache Airflow☆47Updated 2 years ago
- Compute set of important operations for HCTSA code☆27Updated 5 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- Talks about vaex☆36Updated 3 years ago
- 🎛 Distributed machine learning made simple.☆49Updated 2 years ago
- Quickly move data from postgres to numpy or pandas.☆65Updated 2 years ago
- Function dependencies resolution and execution☆71Updated 5 years ago
- Deep Learning how-to's using Lance file format☆22Updated 7 months ago
- Fybrik platform - Arrow/Flight module☆15Updated last year
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆58Updated 4 years ago
- Create MSI installers for PyXLL add-ins☆22Updated 4 years ago
- A python package for running directed acyclic graphs of asynchronous I/O operations☆17Updated 4 years ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆22Updated 3 years ago
- real-time data + ML pipeline☆53Updated this week
- Python client for RedisAI☆89Updated 2 years ago
- Distributed XGBoost on Ray☆152Updated last year
- Cross Thread Message Pipe☆18Updated 6 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆30Updated 3 years ago
- KnowledgeRepo + JupyterLab☆48Updated 2 weeks ago