jimdowling / cjsurf
Lahinch surf predictions with Hopsworks
☆15Updated 2 years ago
Alternatives and similar repositories for cjsurf:
Users that are interested in cjsurf are comparing it to the libraries listed below
- PySpark phonetic and string matching algorithms☆39Updated 11 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 10 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆44Updated 10 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Updated last year
- Code snippets for Data Engineering Design Patterns book☆62Updated 3 weeks ago
- Delta Lake helper methods. No Spark dependency.☆22Updated 4 months ago
- Read Delta tables without any Spark☆47Updated 10 months ago
- A write-audit-publish implementation on a data lake without the JVM☆45Updated 5 months ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- ☆12Updated 4 years ago
- Spark and Delta Lake Workshop☆22Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- ☆17Updated 2 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated last year
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated 8 months ago
- A Table format agnostic data sharing framework☆38Updated 11 months ago
- ☆84Updated last year
- Python - Java/Scala API for the Hopsworks feature store☆54Updated this week
- A workshop with several modules to help learn Feast, an open-source feature store☆84Updated 3 weeks ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Scaling Python Machine Learning☆45Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Unity Catalog UI☆39Updated 4 months ago
- Example repo to kickstart integration with mlflow pipelines.☆74Updated 2 years ago
- ☆54Updated last year