AbsaOSS/spark-hofs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AbsaOSS/spark-hofs)

AbsaOSS / spark-hofs

Scala API for Apache Spark SQL high-order functions

☆15

Alternatives and similar repositories for spark-hofs

Users that are interested in spark-hofs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fqaiser94 / mse
View on GitHub
Make Structs Easy (MSE)
☆18Jun 22, 2020Updated 6 years ago
AbsaOSS / atum
View on GitHub
A dynamic data completeness and accuracy library at enterprise scale for Apache Spark
☆30May 13, 2026Updated 2 months ago
AbsaOSS / hyperdrive
View on GitHub
Extensible streaming ingestion pipeline on top of Apache Spark
☆47Jul 17, 2025Updated last year
AbsaOSS / enceladus
View on GitHub
Dynamic Conformance Engine
☆33Mar 26, 2026Updated 3 months ago
AbsaOSS / pramen
View on GitHub
Resilient data pipeline framework running on Apache Spark
☆31Jul 15, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
swoop-inc / spark-records
View on GitHub
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
☆73Mar 14, 2021Updated 5 years ago
bluecolor / octopus
View on GitHub
Open source task scheduler with dependency management
☆15Jul 1, 2018Updated 8 years ago
hammerlab / spark-tests
View on GitHub
Utilities for writing tests that use Apache Spark.
☆24Dec 29, 2018Updated 7 years ago
indix / sparkplug
View on GitHub
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
☆28May 15, 2020Updated 6 years ago
qubole / streaminglens
View on GitHub
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
☆17Jan 21, 2020Updated 6 years ago
AbsaOSS / spark-hats
View on GitHub
Nested array transformation helper extensions for Apache Spark
☆38Aug 4, 2023Updated 2 years ago
sally / store-spotter
View on GitHub
Command-line tool to find the nearest retail store
☆10Jan 18, 2017Updated 9 years ago
getporter / examples
View on GitHub
Example Porter bundles
☆14Oct 13, 2025Updated 9 months ago
zhukovgreen / friendly-sequences
View on GitHub
Friendly, Scala like, Sequence interface
☆13Jan 13, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AbsaOSS / ABRiS
View on GitHub
Avro SerDe for Apache Spark structured APIs.
☆242Jun 10, 2025Updated last year
carletes / bazel-python-monorepo
View on GitHub
A sample monorepo of several Python libraries and commands, using Bazel as build system
☆13Oct 11, 2017Updated 8 years ago
mayur2810 / sope
View on GitHub
Apache Spark ETL Utilities
☆40Oct 23, 2024Updated last year
CoxAutomotiveDataSolutions / waimak
View on GitHub
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
☆76Apr 24, 2024Updated 2 years ago
RobinDBL / Linux-TUI-Management
View on GitHub
A script to automate and simplify simple system tasks, such as service control, package control, system monitoring, pinging etc. This scr…
☆10Nov 27, 2022Updated 3 years ago
grails / grails-spring-security-ldap
View on GitHub
☆14Feb 28, 2025Updated last year
dhinojosa / oreilly_escalate_scala_3
View on GitHub
OReilly's Escalate with Scala 3 Material
☆13Jun 1, 2021Updated 5 years ago
mrlesmithjr / developers-workstation-setup
View on GitHub
☆18Mar 28, 2026Updated 3 months ago
alexpdp7 / ansible-create-proxmox-centos7-ipa
View on GitHub
An Ansible role to provision CentOS 7 LXC containers on Proxmox integrated with FreeIPA
☆12Oct 12, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
pfisterer / edsc-microk8s-playbook
View on GitHub
Deploy microk8s on OpenStack with MetalLB
☆12Sep 28, 2022Updated 3 years ago
tanthml / spark_bazel
View on GitHub
Spark Application with Bazel
☆16Apr 28, 2018Updated 8 years ago
amient / affinity
View on GitHub
Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka
☆25Oct 16, 2020Updated 5 years ago
Twibot-ai / pony_synth_script
View on GitHub
Executable script for pony voice synthesis project
☆11Jun 21, 2022Updated 4 years ago
WeAreWizards / blog
View on GitHub
We Are Wizards Blog
☆19Oct 31, 2016Updated 9 years ago
acqio / rules_databricks
View on GitHub
This repository contains rules for interacting with Databricks.
☆10Feb 17, 2021Updated 5 years ago
datasphere-oss / datasphere-service
View on GitHub
an open source dataworks platform
☆20Jun 4, 2021Updated 5 years ago
glami / cortex-serving-client
View on GitHub
Cortex.dev ML Serving Client for Python with garbage API collection.
☆15Apr 26, 2023Updated 3 years ago
XANi / uberstatus
View on GitHub
i3 status line generator
☆12Apr 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cfpb / aurora
View on GitHub
An open source enterprise data warehousing and analysis platform.
☆22Nov 8, 2021Updated 4 years ago
davidderus / ansible-rpi
View on GitHub
Make Raspberry Pi up and running in a few command
☆19Apr 22, 2018Updated 8 years ago
apache / daffodil
View on GitHub
Apache Daffodil
☆112Updated this week
AbsaOSS / generate-release-notes
View on GitHub
Efficiently automate your release note generation with 'generate-release-notes'. This GH action scans your target GitHub repository's iss…
☆13Updated this week
cloudera-labs / envelope
View on GitHub
Build configuration-driven ETL pipelines on Apache Spark
☆162Oct 4, 2022Updated 3 years ago
jeoffreylim / maelstrom
View on GitHub
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …
☆21Feb 6, 2017Updated 9 years ago
kudu-book / getting-started-kudu
View on GitHub
☆11Jun 29, 2018Updated 8 years ago