ucbrise / flor
Flow with FlorDB 🌻
☆154Updated last month
Alternatives and similar repositories for flor:
Users that are interested in flor are comparing it to the libraries listed below
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated last year
- Ray-based Apache Beam runner☆43Updated last year
- A Scalable Auto-ML System☆53Updated 2 years ago
- Coarse-grained lineage and tracing for machine learning pipelines.☆467Updated 2 years ago
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆91Updated 11 months ago
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated last year
- Willump Is a Low-Latency Useful Machine learning Platform.☆44Updated last year
- The Data Linter identifies potential issues (lints) in your ML training data.☆87Updated 7 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆299Updated 9 months ago
- Distributed XGBoost on Ray☆147Updated 8 months ago
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.☆127Updated 4 years ago
- A platform for online learning that curtails data latency and saves you cost.☆47Updated 3 years ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆198Updated this week
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆136Updated 5 years ago
- An open-source, vendor-neutral data context service.☆159Updated 7 years ago
- Ibis Substrait Compiler☆100Updated this week
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆42Updated 4 years ago
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- ☆162Updated 3 years ago
- Incubating project for xgboost operator☆76Updated 3 years ago
- A JSON-based schema for storing declarative descriptions of machine learning experiments☆45Updated 7 years ago
- A tool and library for easily deploying applications on Apache YARN☆143Updated last year
- Source code for the split annotations project.☆53Updated 2 years ago
- yogadl, the flexible data layer☆74Updated last year
- ☆55Updated last year
- XGBoost GPU accelerated on Spark example applications☆52Updated 2 years ago