jgperrin / net.jgp.books.spark.ch13View external linksLinks
Spark in Action, 2nd edition - chapter 13 - Transforming documents
☆14Apr 21, 2023Updated 2 years ago
Alternatives and similar repositories for net.jgp.books.spark.ch13
Users that are interested in net.jgp.books.spark.ch13 are comparing it to the libraries listed below
Sorting:
- Spark in Action, 2nd edition - chapter 12 - Transforming your data☆11Feb 6, 2024Updated 2 years ago
- Spark in Action, 2nd edition - chapter 4☆18Apr 21, 2023Updated 2 years ago
- Spark in Action, 2e - chapter 10 - Ingestion through structured streaming☆15Jan 4, 2022Updated 4 years ago
- Spark in Action, 2nd edition - chapter 16 - exporting data, using delta lake☆14Apr 21, 2023Updated 2 years ago
- Spark in Action, 2nd edition - chapter 15 - Aggregating your data☆12Sep 8, 2022Updated 3 years ago
- Spark in Action, 2nd edition - chapter 16 - performance, checkpointing, and caching☆12Apr 21, 2023Updated 2 years ago
- Spark in Action, 2e - chapter 9 - Advanced ingestion: finding data sources and building your own☆18Apr 21, 2023Updated 2 years ago
- Spark in Action, 2nd edition - chapter 7 - Ingestion from files☆20Apr 21, 2023Updated 2 years ago
- Spark in Action, 2nd edition - chapter 3☆24Apr 21, 2023Updated 2 years ago
- Spark in Action, 2nd edition - chapter 1 - Introduction☆107Apr 21, 2023Updated 2 years ago
- Building custom data sources for Apache Spark, in Java.☆12Oct 12, 2020Updated 5 years ago
- Apache Spark examples exclusively in Java☆103Apr 21, 2023Updated 2 years ago
- ☆13Jul 1, 2025Updated 7 months ago
- Base Vagrant file for Hortonworks Data Platform (HDP) instances☆10Feb 2, 2019Updated 7 years ago
- Generate Parquet Files☆13Feb 4, 2026Updated last week
- Classification anomaly detection in IOT with Machine Learning☆15Mar 30, 2020Updated 5 years ago
- Implementation of Decision tree engine from scratch☆21Feb 4, 2014Updated 12 years ago
- Cloud Dataproc: Samples and Utils☆11Sep 23, 2020Updated 5 years ago
- Simple machine learning in Python/Tensorflow with model saving☆14Jul 27, 2017Updated 8 years ago
- Intersectional Fairness (ISF) is a bias detection and mitigation technology for intersectional bias, which combinations of multiple prote…☆20Feb 25, 2025Updated 11 months ago
- ☆19Aug 23, 2022Updated 3 years ago
- Collection of notes about traps & errors in Rust code☆32Jan 31, 2026Updated 2 weeks ago
- A Flat Data GitHub Action demo repo☆14Jan 1, 2024Updated 2 years ago
- ☆22Updated this week
- Minikube for big data with Scala and Spark☆15Oct 28, 2019Updated 6 years ago
- Mock streaming data generator☆17May 31, 2024Updated last year
- CLI tool for syncing a Databricks folder structure with a local git repo.☆17Jul 30, 2024Updated last year
- Data construction exercise 1☆21Jun 21, 2024Updated last year
- Serverless Machine Learning in Action☆19Nov 4, 2021Updated 4 years ago
- Spinix 🌀 is a Go package that provides terminal-based highly customizable and performance loading animations, including spinners and pro…☆21Nov 25, 2024Updated last year
- Analyzing FEMA's National Flood Insurance Program (NFIP) Data With DuckDB.☆25May 27, 2025Updated 8 months ago
- [CSV] Information about Coronavirus disease 2019 (COVID-19) in France☆18Jul 4, 2020Updated 5 years ago
- Replicates any database (CDC events) to Bigquery in real time☆23Dec 2, 2025Updated 2 months ago
- Deep Learning Pipelines for Apache Spark☆18Jun 22, 2017Updated 8 years ago
- ☆17Sep 19, 2023Updated 2 years ago
- ☆24Sep 15, 2025Updated 5 months ago
- ☆75Updated this week
- GCP Terraform example for use in production☆30Dec 31, 2023Updated 2 years ago
- [DEPRECATED] GAE python based app which regularly collects information about GCP resources and stores them in BigQuery☆45Aug 31, 2023Updated 2 years ago