jimdowling / cjsurfLinks
Lahinch surf predictions with Hopsworks
☆15Updated 2 months ago
Alternatives and similar repositories for cjsurf
Users that are interested in cjsurf are comparing it to the libraries listed below
Sorting:
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- ☆86Updated 2 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆25Updated 4 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆55Updated 4 years ago
- A Table format agnostic data sharing framework☆38Updated last year
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆14Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Writing PySpark logs in Apache Spark and Databricks☆17Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆72Updated 3 months ago
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆52Updated 6 months ago
- ☆42Updated 5 years ago
- Delta Lake examples☆227Updated 10 months ago
- ☆91Updated 7 months ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshop…☆144Updated 11 months ago
- ☆17Updated 3 years ago
- ☆48Updated this week
- Read Delta tables without any Spark☆47Updated last year
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- ☆58Updated last year
- Code that was used as an example during the Data+AI Summit 2020☆15Updated 4 years ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆43Updated last month
- New generation opensource data stack☆70Updated 3 years ago
- ☆96Updated 2 years ago