oap-project / oap-mllibLinks
Optimized Spark package to accelerate machine learning algorithms in Apache Spark MLlib.
☆22Updated last week
Alternatives and similar repositories for oap-mllib
Users that are interested in oap-mllib are comparing it to the libraries listed below
Sorting:
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆355Updated this week
- A tool and library for easily deploying applications on Apache YARN☆145Updated last year
- A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.☆165Updated last month
- Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs☆84Updated last week
- Jupyter kernel for scala and spark☆190Updated last year
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆953Updated last week
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆71Updated 5 years ago
- Jupyter extensions for SWAN☆58Updated 2 weeks ago
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated 2 years ago
- Joblib Apache Spark Backend☆249Updated 8 months ago
- User tools for Spark RAPIDS☆65Updated last week
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆50Updated 3 months ago
- Deploy dask on YARN clusters☆69Updated last year
- The Internals of Delta Lake☆187Updated 3 weeks ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94Updated 7 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆231Updated last week
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆130Updated 2 weeks ago
- PMML scoring library for Scala☆66Updated 2 months ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!☆235Updated 11 months ago
- REST API for Apache Spark on K8S or YARN☆109Updated 2 weeks ago
- Point-in-Time optimizations for Apache Spark☆30Updated last year
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 11 months ago
- Train TensorFlow models on YARN in just a few lines of code!☆93Updated 2 years ago
- Apache (Py)Spark type annotations (stub files).☆118Updated 3 years ago
- Friendly ML feature store☆45Updated 3 years ago
- XGBoost GPU accelerated on Spark example applications☆52Updated 3 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Distributed SQL Engine in Python using Dask☆409Updated last year