Scala API for Apache Spark SQL high-order functions
☆14Aug 4, 2023Updated 2 years ago
Alternatives and similar repositories for spark-hofs
Users that are interested in spark-hofs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Make Structs Easy (MSE)☆18Jun 22, 2020Updated 5 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30Apr 15, 2026Updated 2 weeks ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆46Jul 17, 2025Updated 9 months ago
- Dynamic Conformance Engine☆32Mar 26, 2026Updated last month
- Resilient data pipeline framework running on Apache Spark☆28Apr 22, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- ☆11May 16, 2022Updated 3 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- Nested array transformation helper extensions for Apache Spark☆37Aug 4, 2023Updated 2 years ago
- Example Porter bundles☆14Oct 13, 2025Updated 6 months ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Avro SerDe for Apache Spark structured APIs.☆243Jun 10, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Friendly, Scala like, Sequence interface☆12Jan 13, 2026Updated 3 months ago
- A sample monorepo of several Python libraries and commands, using Bazel as build system☆13Oct 11, 2017Updated 8 years ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆76Apr 24, 2024Updated 2 years ago
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- ☆17Mar 28, 2026Updated last month
- A script to automate and simplify simple system tasks, such as service control, package control, system monitoring, pinging etc. This scr…☆10Nov 27, 2022Updated 3 years ago
- ☆14Feb 28, 2025Updated last year
- Command-line tool to find the nearest retail store☆10Jan 18, 2017Updated 9 years ago
- Deploy microk8s on OpenStack with MetalLB☆12Sep 28, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An Ansible role to provision CentOS 7 LXC containers on Proxmox integrated with FreeIPA☆12Oct 12, 2023Updated 2 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Oct 16, 2020Updated 5 years ago
- We Are Wizards Blog☆19Oct 31, 2016Updated 9 years ago
- Executable script for pony voice synthesis project☆11Jun 21, 2022Updated 3 years ago
- an open source dataworks platform☆21Jun 4, 2021Updated 4 years ago
- Cortex.dev ML Serving Client for Python with garbage API collection.☆15Apr 26, 2023Updated 3 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- OReilly's Escalate with Scala 3 Material☆13Jun 1, 2021Updated 4 years ago
- An open source enterprise data warehousing and analysis platform.☆22Nov 8, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- i3 status line generator☆12Apr 11, 2025Updated last year
- Make Raspberry Pi up and running in a few command☆19Apr 22, 2018Updated 8 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆162Oct 4, 2022Updated 3 years ago
- Efficiently automate your release note generation with 'generate-release-notes'. This GH action scans your target GitHub repository's iss…☆13Updated this week
- Apache Daffodil☆109Updated this week
- ☆11Jun 29, 2018Updated 7 years ago
- R COBOL DI (Data Integration) Package : Import COBOL CopyBook data files directly into R as properly structured data frames.☆15Aug 7, 2024Updated last year