Examples for High Performance Spark
☆16Oct 25, 2025Updated 5 months ago
Alternatives and similar repositories for high-performance-spark-examples
Users that are interested in high-performance-spark-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Sep 27, 2017Updated 8 years ago
- Spark and Delta Lake Workshop☆22Jun 14, 2022Updated 3 years ago
- Hadoop Examples☆10Jul 1, 2022Updated 3 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Feb 1, 2018Updated 8 years ago
- A Jenkins plugin that allows to deploy / stop Apache Spark applications in Spark standalone clusters.☆10Oct 25, 2015Updated 10 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- Spark SQL Macros provides a mechanism similar to Spark User-Defined function registration; with the key enhancement being that custom cod…☆16Mar 17, 2021Updated 5 years ago
- Scrapping made easy...☆15Sep 3, 2016Updated 9 years ago
- Serialization from the C API for R☆13Jan 6, 2026Updated 2 months ago
- ☆18Dec 27, 2025Updated 2 months ago
- rvest test grounds☆10Jan 6, 2016Updated 10 years ago
- ☆35Dec 26, 2025Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- ☆13Jan 6, 2026Updated 2 months ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- Example of how to create your own custom serializers for Kafka queues including JSON, Smile and Kryo☆25Mar 10, 2016Updated 10 years ago
- Run Samza as a Spring Boot application☆18Mar 6, 2017Updated 9 years ago
- Reinforcement Learning Algorithms☆14May 28, 2018Updated 7 years ago
- A command-line tool that summarizes the size of a codebase by language, showing lines of code with and without comments and blank lines.☆51Mar 6, 2026Updated 2 weeks ago
- Fulfills a GitHub workflow_job webhooks into a Pub/Sub queue.☆12Mar 13, 2025Updated last year
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Materials and exercises for SICP☆15Feb 13, 2017Updated 9 years ago
- Spark-cloud is a set of scripts for starting spark clusters on ec2☆12Dec 21, 2015Updated 10 years ago
- Code files for Mastering JBoss Drools 6, published by Packt☆11Sep 12, 2023Updated 2 years ago
- Repository for the dbt Semantic Layer course☆13Updated this week
- Use Azure Monitor to track your Spark jobs in Azure Databricks☆11Jul 16, 2020Updated 5 years ago
- A set of example build and release pipelines for deploying Python and Scala to Azure Databricks and HDInsight☆14Jun 4, 2020Updated 5 years ago
- Apache Airflow Docker Image.☆16May 3, 2018Updated 7 years ago
- Data pipeline example written in Rust with Polars and DataFusion DataFrame package☆41Mar 12, 2023Updated 3 years ago
- Apache Zeppelin with support for SQL Server☆16Sep 25, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The example of using cue with Kubernetes, ArgoCD and Crossplane☆18Sep 9, 2024Updated last year
- R interface to XBRL US API☆22Feb 22, 2018Updated 8 years ago
- Code for the Spark tutorial at the Pydata conference in London June 2015☆12Oct 9, 2016Updated 9 years ago
- Kylie is a blond and small Erlang/Elixir client for Cayley graph data base☆12Feb 15, 2026Updated last month
- The DAMN (Data Assets Metric Navigation) tool extracts and reports metrics about your data assets☆11Dec 27, 2024Updated last year
- Source code for 'Foundations of Python Network Programming' by Brandon Rhodes and John Goerzen☆13Mar 28, 2017Updated 8 years ago
- R wrapper for the Data Science Toolkit☆26Nov 20, 2017Updated 8 years ago