Adyen / feast-spark-offline-store
This repo contains a plugin for feast to run an offline store on Spark
☆14Updated 2 years ago
Alternatives and similar repositories for feast-spark-offline-store:
Users that are interested in feast-spark-offline-store are comparing it to the libraries listed below
- Delta Lake examples☆221Updated 5 months ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆143Updated 8 months ago
- The gateway component to make Spark on K8s much easier for Spark users.☆187Updated 2 months ago
- ☆55Updated last year
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆50Updated 2 weeks ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆43Updated 8 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆180Updated 2 weeks ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆67Updated 11 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- End to End example integrating MLFlow and Seldon Core☆51Updated 4 years ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆212Updated last week
- Official Dockerfile for Apache Spark☆128Updated last month
- ☆16Updated 8 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated 3 weeks ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆120Updated this week
- Azure plugins for Feast (FEAture STore)☆82Updated last year
- Python - Java/Scala API for the Hopsworks feature store☆54Updated last week
- Unity Catalog UI☆40Updated 6 months ago
- Dataproc templates and pipelines for solving simple in-cloud data tasks☆126Updated this week
- ☆30Updated 2 years ago
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆54Updated last week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dataproc☆48Updated last year
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated 2 years ago
- Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.☆53Updated 2 years ago
- A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.☆147Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆76Updated last month
- Apache Spark Kubernetes Operator☆106Updated last week
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆172Updated 7 months ago