logicalclocks / feature-store-api
Python - Java/Scala API for the Hopsworks feature store
☆53Updated last week
Related projects: ⓘ
- Point-in-Time optimizations for Apache Spark☆29Updated 8 months ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆117Updated last year
- Open Benchmarks for Evaluating the Performance of Feature Stores☆34Updated 6 months ago
- Grouped time series forecasting engine☆36Updated last year
- HopsWorks - Hadoop for Humans☆116Updated 5 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆89Updated 6 months ago
- XGBoost GPU accelerated on Spark example applications☆51Updated 2 years ago
- A tool and library for easily deploying applications on Apache YARN☆142Updated 6 months ago
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai☆27Updated 3 months ago
- Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Mo…☆191Updated 3 years ago
- Accelerator to rapidly deploy customized features for your business☆55Updated 9 months ago
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆147Updated this week
- Extensible Rules Engine for custom Dataframe / Dataset validation☆134Updated 4 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆193Updated this week
- Flowchart for debugging Spark applications☆100Updated last week
- real-time data + ML pipeline☆54Updated this week
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- Deploy dask on YARN clusters☆69Updated last month
- ☆54Updated 8 months ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆82Updated 5 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆53Updated last year
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 3 years ago
- Instant access to the Spark cluster from anywhere☆16Updated 3 years ago
- The Internals of Delta Lake☆180Updated last month
- ☆104Updated last year
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.☆118Updated this week
- type-class based data cleansing library for Apache Spark SQL☆79Updated 5 years ago
- Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.☆50Updated 2 years ago
- A simple Spark-powered ETL framework that just works 🍺☆177Updated 9 months ago