findify / featury
Friendly ML feature store
☆45Updated 2 years ago
Alternatives and similar repositories for featury:
Users that are interested in featury are comparing it to the libraries listed below
- The Internals of PySpark☆25Updated 2 weeks ago
- Data Sketches for Apache Spark☆22Updated 2 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- Point-in-Time optimizations for Apache Spark☆29Updated last year
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Spark ML Lib serving library☆48Updated 6 years ago
- ☆47Updated 5 months ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 3 years ago
- Magic to help Spark pipelines upgrade☆34Updated 3 months ago
- ☆63Updated 5 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- Kubeflow example of machine learning/model serving☆35Updated 5 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 2 weeks ago
- Spark Structured Streaming State Tools☆34Updated 4 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆58Updated last year
- Code and presentation for Strata Model Serving tutorial☆68Updated 5 years ago
- Flowchart for debugging Spark applications☆104Updated 3 months ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆35Updated last month
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆11Updated 4 years ago
- Filling in the Spark function gaps across APIs☆50Updated 3 years ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆46Updated last month
- Dione - a Spark and HDFS indexing library☆50Updated 9 months ago
- ☆31Updated 5 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- A Spark datasource for the HadoopOffice library☆39Updated 2 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Updated last year