weslleylc / Feature-StoreLinks
A containerized approach using Apache Kafka, Spark, Cassandra, Hive, Jupyter, and Docker-compose.
☆14Updated 4 years ago
Alternatives and similar repositories for Feature-Store
Users that are interested in Feature-Store are comparing it to the libraries listed below
Sorting:
- Joblib Apache Spark Backend☆249Updated 10 months ago
- Code review for data in dbt☆493Updated last year
- Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.☆53Updated 3 years ago
- Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'☆121Updated 2 years ago
- Airflow Unit Tests and Integration Tests☆261Updated 3 years ago
- ☆201Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- Asynchronous actions for PySpark☆48Updated 4 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆90Updated 4 years ago
- Airflow training for the crunch conf☆105Updated 7 years ago
- Great Expectations Airflow operator☆170Updated last week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆347Updated last year
- Enforce Best Practices for all your Airflow DAGs. ⭐☆108Updated 2 weeks ago
- ☆44Updated 3 years ago
- Deploy MLflow with HTTP basic authentication using Docker☆104Updated 3 weeks ago
- A Helm chart to install Apache Airflow on Kubernetes☆292Updated last week
- Astronomer Core Docker Images☆105Updated last year
- A plugin for Apache Airflow that allows you to edit DAGs in browser☆458Updated 2 weeks ago
- A boilerplate for writing PySpark Jobs☆395Updated 2 years ago
- Pylint plugin for static code analysis on Airflow code☆97Updated 5 years ago
- Visualize dependencies between Airflow DAGs☆49Updated 4 years ago
- Read Delta tables without any Spark☆47Updated last year
- ☆269Updated last year
- Spark on Kubernetes infrastructure Helm charts repo☆203Updated 3 years ago
- A workshop with several modules to help learn Feast, an open-source feature store☆97Updated 2 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆76Updated 4 years ago
- Repo for all my code on the articles I post on medium☆106Updated 3 years ago
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated 3 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago