jchacks / data_cacheLinks
Simple in memory data cache designed for ML applications. Built using Redis and Apache Arrow's Plasma in-memory store
☆11Updated 5 years ago
Alternatives and similar repositories for data_cache
Users that are interested in data_cache are comparing it to the libraries listed below
Sorting:
- Unified Distributed Execution☆57Updated last year
- Apache Arrow Flight example☆11Updated 5 years ago
- 🎛 Distributed machine learning made simple.☆49Updated 2 years ago
- 🎯 aimrocks 🎸 — python & cython bindings for RocksDB. Batteries included! 🔋☆33Updated 2 weeks ago
- Distributed XGBoost on Ray☆152Updated last year
- Convenient pyarrow operations following the Pandas API☆45Updated 3 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆107Updated 2 years ago
- Shared-memory Python object namespace with Apache Plasma. Built because of Plotly Dash, useful anywhere.☆83Updated last year
- zero-code hyperparameters optimization framework☆14Updated last year
- RedisAI showcase☆58Updated last year
- Function dependencies resolution and execution☆71Updated 5 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆105Updated 2 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- Deploy dask on YARN clusters☆69Updated last year
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- RedisAI integration for MLFlow☆30Updated 2 years ago
- Python client for RedisAI☆89Updated 2 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- Automated Transparent Genetic Feature Engineering☆22Updated 2 years ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆21Updated 3 years ago
- A place for cython code☆34Updated 3 years ago
- streamlit games☆14Updated 3 years ago
- Talks about vaex☆36Updated 3 years ago
- A web frontend for scheduling Jupyter notebook reports☆254Updated last year
- Streaming API for pandas applied to big datasets☆31Updated last month
- A library to compute histograms on distributed environments, on streaming data☆23Updated 10 months ago
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆30Updated 3 years ago