jimdowling / cjsurfLinks
Lahinch surf predictions with Hopsworks
☆15Updated 8 months ago
Alternatives and similar repositories for cjsurf
Users that are interested in cjsurf are comparing it to the libraries listed below
Sorting:
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- PySpark phonetic and string matching algorithms☆41Updated last year
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 4 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 5 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- ☆95Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- Keep your local python scripts installed and in sync with a databricks notebook. Shortens the feedback loop to develop projects using a h…☆16Updated 7 months ago
- Code that was used as an example during the Data+AI Summit 2020☆15Updated 4 years ago
- MLflow App Library☆77Updated 7 years ago
- Openscoring application for the Docker distributed applications platform☆12Updated 5 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Updated 3 years ago
- This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb☆18Updated 2 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 3 years ago
- Kubeflow example of machine learning/model serving☆37Updated 6 years ago
- ☆42Updated 5 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆78Updated 9 months ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆86Updated 2 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Updated 5 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Workshop for Spark and Databricks☆54Updated 6 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Updated last year
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 5 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆48Updated last year
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Updated last year
- Demonstration code for MLeap, both Jupyter notebooks and projects☆24Updated 6 years ago
- Utility functions for dbt projects running on Spark☆34Updated last month
- A Table format agnostic data sharing framework☆42Updated 2 years ago
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆53Updated last year
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 4 years ago