jchacks / data_cacheLinks
Simple in memory data cache designed for ML applications. Built using Redis and Apache Arrow's Plasma in-memory store
☆11Updated 4 years ago
Alternatives and similar repositories for data_cache
Users that are interested in data_cache are comparing it to the libraries listed below
Sorting:
- Convenient pyarrow operations following the Pandas API☆45Updated 3 years ago
- Unified Distributed Execution☆56Updated 10 months ago
- Shared-memory Python object namespace with Apache Plasma. Built because of Plotly Dash, useful anywhere.☆83Updated 7 months ago
- Derivatives models written with the Tributary data flow library☆23Updated last week
- 🎛 Distributed machine learning made simple.☆49Updated 2 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- Deep Learning how-to's using Lance file format☆20Updated 2 months ago
- LightGBM on Ray☆50Updated last year
- Quickly move data from postgres to numpy or pandas.☆65Updated 2 years ago
- Function dependencies resolution and execution☆71Updated 5 years ago
- real-time data + ML pipeline☆54Updated last week
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- 🎯 aimrocks 🎸 — python & cython bindings for RocksDB. Batteries included! 🔋☆32Updated last week
- Bidirectional communication for the HoloViz ecosystem☆34Updated last month
- Streaming API for pandas applied to big datasets☆31Updated 11 months ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Scalable pattern search optimization with dask☆22Updated 8 years ago
- Apache Arrow Flight example☆11Updated 4 years ago
- Compute set of important operations for HCTSA code☆26Updated 5 years ago
- Python client for RedisAI☆89Updated 2 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Automation tools for Python benchmarking☆19Updated 6 years ago
- ☆46Updated last year
- ☆30Updated 4 years ago
- Cell-by-cell testing for production Jupyter notebooks in JupyterLab☆95Updated last week
- Ray provider for Apache Airflow☆48Updated last year
- Materials for the SciPy 2019 RAPIDS tutorial☆22Updated 6 years ago
- ☆92Updated 5 years ago