jimdowling / cjsurfLinks
Lahinch surf predictions with Hopsworks
☆15Updated 6 months ago
Alternatives and similar repositories for cjsurf
Users that are interested in cjsurf are comparing it to the libraries listed below
Sorting:
- A Table format agnostic data sharing framework☆42Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- ☆92Updated 2 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Read Delta tables without any Spark☆47Updated last year
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated last week
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- ☆97Updated 2 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- A write-audit-publish implementation on a data lake without the JVM☆45Updated last year
- Utility functions for dbt projects running on Spark☆33Updated 3 weeks ago
- Projects developed by Domino's R&D team☆77Updated 3 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 5 years ago
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆52Updated 10 months ago
- New generation opensource data stack☆75Updated 3 years ago
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆14Updated 4 years ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆44Updated last month
- Big Data Demystified meetup and blog examples☆31Updated last year
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- Delta Lake examples☆233Updated last year
- Unity Catalog UI☆43Updated last year
- ☆42Updated 5 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- ☆58Updated last year
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆32Updated 4 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆76Updated 7 months ago