ucbrise / flor
Flow with FlorDB 🌻
☆155Updated 2 weeks ago
Alternatives and similar repositories for flor:
Users that are interested in flor are comparing it to the libraries listed below
- RAPIDS GPU-BDB☆108Updated last year
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆92Updated last year
- Willump Is a Low-Latency Useful Machine learning Platform.☆44Updated 2 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated 11 months ago
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated last year
- Ray-based Apache Beam runner☆42Updated last year
- The Data Linter identifies potential issues (lints) in your ML training data.☆88Updated 7 years ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated last year
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆207Updated this week
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- Coarse-grained lineage and tracing for machine learning pipelines.☆469Updated 2 years ago
- Incubating project for xgboost operator☆77Updated 3 years ago
- Distributed XGBoost on Ray☆148Updated 10 months ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- A Scalable Auto-ML System☆53Updated 2 years ago
- Ibis Substrait Compiler☆102Updated this week
- A library that translates Python and NumPy to optimized distributed systems code.☆132Updated 2 years ago
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆155Updated 7 months ago
- A platform for online learning that curtails data latency and saves you cost.☆47Updated 3 years ago
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆136Updated 5 years ago
- ☆58Updated last year
- yogadl, the flexible data layer☆74Updated 2 years ago
- A JSON-based schema for storing declarative descriptions of machine learning experiments☆45Updated 7 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.☆127Updated 5 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- 📚 Notebook storage and publishing workflows for the masses☆202Updated 3 years ago
- XGBoost GPU accelerated on Spark example applications☆52Updated 2 years ago
- An open-source, vendor-neutral data context service.☆159Updated 7 years ago