CODAIT / aardpfark
A library for exporting Spark ML models and pipelines to PFA
☆54Updated 6 years ago
Alternatives and similar repositories for aardpfark:
Users that are interested in aardpfark are comparing it to the libraries listed below
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Implementations of the Portable Format for Analytics (PFA)☆129Updated 2 years ago
- Spark ML Lib serving library☆48Updated 6 years ago
- PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)☆94Updated 2 years ago
- Library for organizing batch processing pipelines in Apache Spark☆41Updated 8 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 8 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- functionstest☆33Updated 8 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated last year
- Cheatsheet for Spark DataFrame☆91Updated 5 years ago
- Spark Parameter Optimization and Tuning☆31Updated 6 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 7 years ago
- something to help you spark☆65Updated 6 years ago
- type-class based data cleansing library for Apache Spark SQL☆79Updated 5 years ago
- Routines and data structures for using isarn-sketches idiomatically in Apache Spark☆29Updated 8 months ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- This project provides sequential pattern mining for Apache Spark. The algorithms are based on the work of Philippe Fournier-Viger and co…☆30Updated 9 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆147Updated 9 years ago
- Spark package for checking data quality☆221Updated 4 years ago
- Using JPMML Evaluator to validate the PMML models exported from Spark☆19Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- Splittable Gzip codec for Hadoop☆70Updated 3 weeks ago
- ☆106Updated 2 years ago
- Building Annoy Index on Apache Spark☆72Updated 4 years ago
- Affinity Propagation on Spark☆19Updated 3 years ago
- Factorization Machines on Spark and Glint☆25Updated 8 years ago
- ☆110Updated 7 years ago
- Spark Implementation of Google Facets Overview https://github.com/PAIR-code/facets☆54Updated last year
- A tool and library for easily deploying applications on Apache YARN☆142Updated 11 months ago