☆32Mar 21, 2018Updated 8 years ago
Alternatives and similar repositories for scala-spark-application
Users that are interested in scala-spark-application are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆63Sep 6, 2024Updated last year
- A sample implementation of the Spark Datasource API☆24Apr 15, 2017Updated 9 years ago
- Salt Formula to set up and configure Cassandra cluster☆12Aug 11, 2015Updated 10 years ago
- ☆12Oct 24, 2025Updated 7 months ago
- Learn Kubeflow with Arrikto☆15Jan 4, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Essential Spark extensions and helper methods ✨😲☆767Sep 14, 2025Updated 8 months ago
- An Ansible collection of utilities and other resources for Cloudera Platform deployments☆13May 4, 2026Updated 3 weeks ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Dec 15, 2024Updated last year
- ☆12Jun 26, 2023Updated 2 years ago
- Data Exploration Using Spark 2.0☆14Apr 17, 2018Updated 8 years ago
- Encrypts long values (64 bit) and UUIDs (128 bit)☆10Feb 6, 2023Updated 3 years ago
- A NANO node implemented in Typescript/JavaScript☆12Jan 6, 2023Updated 3 years ago
- Hudi Demo Notebook☆11Mar 5, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Strange Loop talk on Specter☆13Sep 26, 2015Updated 10 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Mar 8, 2021Updated 5 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Nov 29, 2016Updated 9 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆455Apr 2, 2026Updated last month
- Simplified custom plugins for Trino☆16Jul 29, 2024Updated last year
- Script para importar dataset de "df_gtfs" a PostgreSQL☆13Jun 24, 2013Updated 12 years ago
- A repository used in a NiFi Registry demo☆13Mar 11, 2020Updated 6 years ago
- Scripts about docker and cluster managemant☆13Jul 15, 2019Updated 6 years ago
- Scan QR Codes from video stream.☆15Mar 23, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Template for Spark Projects☆104May 21, 2024Updated 2 years ago
- ☆10Mar 12, 2021Updated 5 years ago
- CMS with a Markdown Editor and comments☆10Sep 9, 2024Updated last year
- ETL jobs for Firefox Telemetry☆29May 7, 2026Updated 3 weeks ago
- ☆13Jan 23, 2023Updated 3 years ago
- Scala-based project to visualize Scala programs in UML class diagrams.☆12Aug 30, 2023Updated 2 years ago
- My journey to learn Scala.☆49Apr 21, 2019Updated 7 years ago
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆106Jan 22, 2024Updated 2 years ago
- Access Amazon Elasticsearch through API Gateway and Lambda☆10Dec 7, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Java implementation of MQTT 5.0 compatible broker based on RLib library.☆15May 19, 2026Updated last week
- 2019 PyOhio talk and code sample on spotify/luigi☆11Aug 14, 2023Updated 2 years ago
- Loops in Oozie☆10Feb 15, 2015Updated 11 years ago
- Optimized (fast) Java Keccak/SHA3/SHAKED implementation☆14Oct 12, 2020Updated 5 years ago
- ☆15Updated this week
- Spark based implementation of the Topological Mapper algorithm☆15May 16, 2017Updated 9 years ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago