project-codeflare / rayvensLinks
Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.
☆50Updated 2 years ago
Alternatives and similar repositories for rayvens
Users that are interested in rayvens are comparing it to the libraries listed below
Sorting:
- Ray-based Apache Beam runner☆41Updated last year
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆233Updated last week
- Unified Distributed Execution☆55Updated 9 months ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- Ray provider for Apache Airflow☆48Updated last year
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated last year
- ForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.☆73Updated last year
- Flow with FlorDB 🌻☆154Updated 2 months ago
- Python driver for Timeplus Enterprise or Timeplus Proton☆15Updated 8 months ago
- ☆36Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆343Updated 3 weeks ago
- Python stream processing for analytics☆40Updated last month
- Real-time data processing/feature engineering in Python and Rust. Tailored for modern AI/ML systems.☆62Updated last week
- Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.☆282Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated last year
- Distributed XGBoost on Ray☆149Updated last year
- Flyte Documentation 📖☆83Updated 4 months ago
- real-time data + ML pipeline☆54Updated last week
- Fybrik☆132Updated last year
- Parquet-based ML data format optimized for working with unstructured data☆140Updated 2 years ago
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.☆103Updated 2 months ago
- Repository for open inference protocol specification☆59Updated 2 months ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year
- A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel app…☆343Updated this week
- Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.☆233Updated last year
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- Ibis Substrait Compiler☆104Updated 2 weeks ago
- ☆106Updated 2 years ago