A simple introduction to using spark ml pipelines
☆26Apr 5, 2018Updated 7 years ago
Alternatives and similar repositories for spark-intro-ml-pipeline-workshop
Users that are interested in spark-intro-ml-pipeline-workshop are comparing it to the libraries listed below
Sorting:
- Simple Spark app that reads and writes Avro data☆31Apr 13, 2015Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Nov 9, 2023Updated 2 years ago
- Repository for the dbt Semantic Layer course☆12Updated this week
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated 2 weeks ago
- ☆15Dec 23, 2022Updated 3 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- This is where we store all the fantastic open government data we will provide to GovHackNZ and beyond☆11Jul 28, 2017Updated 8 years ago
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- A bunch of crawlers for extracting data from various sites (site name is mentioned for each one)☆11May 2, 2024Updated last year
- ☆10Jan 5, 2018Updated 8 years ago
- Exploration of spark streaming based on the BigData.be project 2☆15Sep 2, 2013Updated 12 years ago
- Python library for Evaluation☆16Feb 16, 2026Updated 3 weeks ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- This Unity project represents the complete client setup to communicate with a Unity custom GameLift server.☆10Aug 31, 2022Updated 3 years ago
- Dockerfile and artifacts for running a self-contained HDP 2.3 "cluster" in a docker container☆10Aug 30, 2016Updated 9 years ago
- Ansible Role to install a Hadoop Cluster☆10Sep 21, 2020Updated 5 years ago
- ☆10Jul 6, 2018Updated 7 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 3 months ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 3 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Material for "PyTorch from Ground Up", a training session at PyCon Nove (Florence, 2018)☆10Apr 21, 2018Updated 7 years ago
- Everyday Analytics and Visualization - JuliaCon 2015☆10Sep 25, 2015Updated 10 years ago
- ☆11Jun 21, 2022Updated 3 years ago
- Build an akaunting Docker image and run with swarm / docker-compose☆10Apr 14, 2020Updated 5 years ago
- Source code for http://allaboutscala.com/scala-cheatsheet/☆11Jun 12, 2018Updated 7 years ago
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- Deep learning with TensorFlow and Keras.☆12Jun 18, 2019Updated 6 years ago
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- Find the posterior decoding of a long sequence of observations.☆17Jul 29, 2010Updated 15 years ago
- Implementation of the query event listener plugin in Java to log Presto statistics on Amazon EMR for auditing and performance insights☆13May 26, 2018Updated 7 years ago
- Data ingestion examples☆11Feb 12, 2015Updated 11 years ago
- New nixnote is cloned on miurahr/nixnote2 ... Nixnote (formaly nevernote) is imcomplete evernote OSS cilent. here is a development branch…☆19Feb 16, 2013Updated 13 years ago
- NixNoteにタブブラウザ機能と連想ノート機能を付けました。連想ノート機能はユーザの操作履歴に基づいて関連するノートを算出して提示します。☆14Jul 20, 2015Updated 10 years ago
- ☆14Oct 4, 2018Updated 7 years ago
- Serverless ML prediction system to predict electricity demand in NY, USA☆13Feb 3, 2023Updated 3 years ago
- ☆15Dec 11, 2023Updated 2 years ago