datapunchorg/punch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/datapunchorg/punch)

datapunchorg / punch

This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like Apache Spark.

☆55

Alternatives and similar repositories for punch

Users that are interested in punch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / yunikorn-scheduler-interface
View on GitHub
Apache YuniKorn Scheduler Interface
☆35Updated this week
apple / batch-processing-gateway
View on GitHub
The gateway component to make Spark on K8s much easier for Spark users.
☆221May 6, 2026Updated 2 months ago
palantir / k8s-spark-scheduler
View on GitHub
A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes
☆179Apr 23, 2023Updated 3 years ago
bootcamp-go / bcgo-w11
View on GitHub
☆16Feb 15, 2024Updated 2 years ago
datapunchorg / spark-ui-reverse-proxy
View on GitHub
This project provides a reverse proxy for Spark UI on Kubernetes
☆16Oct 12, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pravega / bookkeeper-operator
View on GitHub
Kubernetes Operator for bookkeeper
☆16Feb 13, 2024Updated 2 years ago
criteo / babar
View on GitHub
Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
☆129Sep 7, 2018Updated 7 years ago
tony612 / kexplain
View on GitHub
Kexplain is an interactive kubectl explain
☆12Oct 23, 2023Updated 2 years ago
renat0sn / QuintoAndar-WebScrapping
View on GitHub
Webscrapping project to scrape rent properties content in São Paulo - SP, using the brazilian famous housing website QuintoAndar.
☆12May 2, 2023Updated 3 years ago
lasersonlab / ndarray.scala
View on GitHub
N-dimensional arrays, with Zarr and HDF5 integrations
☆19Feb 26, 2019Updated 7 years ago
moja-global / Land_Sector_Datasets
View on GitHub
This Repo is bringing together datasets that can be useful for land sector management. The range of datasets is going beyond what is need…
☆13Dec 11, 2023Updated 2 years ago
apache / horaedb-client-rs
View on GitHub
Apache HoraeDB (Incubating) Rust Client.
☆18Jun 13, 2025Updated last year
stettix / chronicles
View on GitHub
Version controlled immutable storage for Big Data
☆11Apr 20, 2021Updated 5 years ago
GreptimeTeam / greptimedb-ingester-erl
View on GitHub
An Erlang ingester for GreptimeDB, which is compatible with GreptimeDB protocol and lightweight.
☆16Apr 17, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hartza-capital / fluent-bit-go-gcs
View on GitHub
Fluent-Bit output plugin for Google Cloud Storage
☆12Jul 13, 2021Updated 5 years ago
blaze-init / spark-blaze-extension
View on GitHub
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
☆11Apr 23, 2022Updated 4 years ago
brooklyn-data / delta
View on GitHub
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…
☆10Feb 10, 2023Updated 3 years ago
warmchang / KubeCon-CloudNativeCon-OpenSourceSummit-AI_dev-China-2024
View on GitHub
KubeCon-CloudNativeCon-OpenSourceSummit-AI_dev-China-2024's slides. / 2024中国(香港)CNCF大会PPT。
☆12Aug 31, 2024Updated last year
phenpessoa / yt-utf8
View on GitHub
A simple UTF-8 decoding algorithm used to teach the standard on my YouTube channel
☆10Feb 1, 2024Updated 2 years ago
apache / kyuubi-client
View on GitHub
Client libraries of end users of Apache Kyuubi
☆11May 15, 2026Updated 2 months ago
dmatrix / feast_workshops
View on GitHub
A series of workshop modules introducing Feast feature store.
☆18May 31, 2022Updated 4 years ago
chriso / roaring-bitmap
View on GitHub
Roaring bitmaps in C
☆17Mar 8, 2016Updated 10 years ago
fbusrayaman / face-anti-spoofing
View on GitHub
CDCN++ face anti spoofing model experimented on and added to during tübitak internship
☆10Oct 28, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cosven / db-testing
View on GitHub
scripts for testing TiDB
☆10Feb 4, 2026Updated 5 months ago
alibaba-archive / aliyun-oss-hadoop-fs
View on GitHub
Hadoop filesystem implementation for Aliyun OSS
☆13Feb 14, 2016Updated 10 years ago
cwida / pvldbstyle
View on GitHub
PVLDB LaTeX style, based on acmart
☆16Apr 29, 2021Updated 5 years ago
CypherNova1337 / Auto-IDOR
View on GitHub
An interactive bash script for detecting IDOR vulnerabilities. Automates the discovery of access control issues in web applications, enha…
☆14Apr 10, 2025Updated last year
zjffdu / flink-udf
View on GitHub
☆12Mar 12, 2021Updated 5 years ago
erikerlandson / ray-odh-demo
View on GitHub
Prototype an integration of ray with Open Data Hub, using a singleuser profile to provision a ray cluster
☆14Mar 23, 2022Updated 4 years ago
dnnmedia / dnn_site_demo
View on GitHub
DNN's alpha version running on the Kovan Testnet
☆18May 13, 2018Updated 8 years ago
rgb91 / temporal-deepfake-segmentation
View on GitHub
Transformer Model to detect deepfakes from popular datasets. Predictions made on embeddings (features) generated by a different ViT model…
☆14Nov 27, 2023Updated 2 years ago
microsoft / dstoolkit-genai-shap
View on GitHub
SHAP (SHapley Additive exPlanations) for Generative AI (LLMs and SMLs) based solutions.
☆19Jul 4, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bytedance / CloudShuffleService
View on GitHub
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
☆261May 12, 2024Updated 2 years ago
lagerspetz / TimeSeriesSpark
View on GitHub
Time series and energy data analysis API for Spark.
☆19May 1, 2012Updated 14 years ago
aravinthsci / Spark_Delta_Lake
View on GitHub
Delta Lake Examples
☆11Apr 24, 2020Updated 6 years ago
mahdyne / pyspark-tut
View on GitHub
☆23Nov 26, 2020Updated 5 years ago
xuw10 / kubeflow-tfx-workshop
View on GitHub
Hands-on Learning with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + Kubernetes + Airflow + Jupyter
☆11Oct 28, 2022Updated 3 years ago
yulrizka / rxscan
View on GitHub
Go library to scan regular expression capture group to variable similar to fmt.Scanf
☆14Dec 15, 2020Updated 5 years ago
ory / works
View on GitHub
This repository shows examples of practical solutions using Ory projects and other OSS
☆10Jul 14, 2022Updated 4 years ago