jimdowling / cjsurfLinks
Lahinch surf predictions with Hopsworks
☆15Updated 5 months ago
Alternatives and similar repositories for cjsurf
Users that are interested in cjsurf are comparing it to the libraries listed below
Sorting:
- PySpark phonetic and string matching algorithms☆39Updated last year
- Read Delta tables without any Spark☆47Updated last year
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 5 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- ☆42Updated 5 years ago
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆26Updated 4 years ago
- ☆90Updated 2 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- ☆40Updated 3 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated 2 years ago
- ☆29Updated 4 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆75Updated 6 months ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆32Updated 4 years ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- ☆97Updated 2 years ago
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆52Updated 9 months ago
- Capturing model drift and handling its response - Example webinar☆108Updated 6 years ago
- Weekly Data Engineering Newsletter☆96Updated last year
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆71Updated 5 months ago
- Delta Lake Documentation☆50Updated last year
- A Table format agnostic data sharing framework☆41Updated last year
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 3 years ago
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆14Updated 4 years ago
- Utility functions for dbt projects running on Spark☆33Updated 8 months ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Keep your local python scripts installed and in sync with a databricks notebook. Shortens the feedback loop to develop projects using a h…☆16Updated 4 months ago