jchacks / data_cacheLinks
Simple in memory data cache designed for ML applications. Built using Redis and Apache Arrow's Plasma in-memory store
☆11Updated 4 years ago
Alternatives and similar repositories for data_cache
Users that are interested in data_cache are comparing it to the libraries listed below
Sorting:
- Unified Distributed Execution☆56Updated 11 months ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- Derivatives models written with the Tributary data flow library☆24Updated this week
- real-time data + ML pipeline☆54Updated last week
- Convenient pyarrow operations following the Pandas API☆45Updated 3 years ago
- Apache Arrow Flight example☆11Updated 4 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 4 years ago
- 🎛 Distributed machine learning made simple.☆49Updated 2 years ago
- A python package for running directed acyclic graphs of asynchronous I/O operations☆17Updated 3 years ago
- A Python package that parses sql and converts it to ibis expressions☆55Updated last year
- Deploy dask on YARN clusters☆69Updated last year
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Deep Learning how-to's using Lance file format☆20Updated 3 months ago
- SQLAlchemy for Dremio via the ODBC and Flight interface.☆30Updated 2 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- A friendly fork of the Python Standard Library multiprocessing package which uses dill instead of pickle☆35Updated 5 years ago
- ☆162Updated 4 years ago
- mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model t…☆25Updated 4 years ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆21Updated 3 years ago
- Function dependencies resolution and execution☆71Updated 5 years ago
- Compute set of important operations for HCTSA code☆26Updated 5 years ago
- Read Delta tables without any Spark☆47Updated last year
- KnowledgeRepo + JupyterLab☆48Updated last week
- ByteHub: making feature stores simple☆61Updated 4 years ago
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- Serverless Python with Ray☆58Updated 2 years ago
- Python client for RedisAI☆89Updated 2 years ago