DIYBigData/pyspark-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DIYBigData/pyspark-benchmark)

DIYBigData / pyspark-benchmark

A lightweight benchmark utility for PySpark

☆20

Alternatives and similar repositories for pyspark-benchmark

Users that are interested in pyspark-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DIYBigData / spark-data-analysis-projects
View on GitHub
A collection of data analysis projects done using PySpark via Jupyter notebooks.
☆10Oct 8, 2022Updated 3 years ago
aws-samples / emr-spark-benchmark
View on GitHub
☆26Apr 26, 2026Updated 2 months ago
JerryLead / SparkProfiler
View on GitHub
Profiling Spark Applications for Performance Comparison and Diagnosis
☆16Nov 11, 2018Updated 7 years ago
agelbess / k8scms
View on GitHub
K8s ready headless CMS featuring Mongo and Quarkus
☆12Oct 2, 2021Updated 4 years ago
dnguyenngoc / real-time-analytic
View on GitHub
This repo gives an introduction to setting up streaming analytics using open source technologies
☆25Mar 2, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dfreelon / geostring
View on GitHub
From free-form text to standardized geographical info.
☆11Jul 3, 2019Updated 7 years ago
cheukhin1024 / Financial-Data-Project-in-Azure
View on GitHub
Free High-Quality Financial Data in Azure
☆13Jun 15, 2024Updated 2 years ago
protea-earth / greta
View on GitHub
👧 Greta is an agile voice assistant to help reduce your carbon footprint.
☆13Apr 24, 2023Updated 3 years ago
larecipe / larecipe-feedback
View on GitHub
Get feedback from your users about your documentations.
☆19Sep 19, 2022Updated 3 years ago
jimit105 / Intro-to-Deep-Learning-with-PyTorch
View on GitHub
☆11Feb 29, 2020Updated 6 years ago
Rjerk / alpine-pkg-glibc
View on GitHub
A glibc compatibility layer package for Alpine Linux (arm64)
☆11Jan 10, 2020Updated 6 years ago
Kavit900 / data-streaming-kafka-flink-postgres
View on GitHub
☆35Nov 25, 2023Updated 2 years ago
XpressAI / technologic
View on GitHub
Technologic is a user-friendly AI Chatbot Client packed with features to enhance your chatting experience. Securely store conversations, …
☆18Feb 1, 2025Updated last year
kishlayjeet / Stock-Market-Real-Time-Data-Pipeline-with-Apache-Kafka-and-Cassandra
View on GitHub
A end-to-end real-time stock market data pipeline with Python, AWS EC2, Apache Kafka, and Cassandra Data is processed on AWS EC2 with Apa…
☆29Jun 7, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bytedance / eurosys24-artifacts
View on GitHub
Artifacts of EuroSys'24 paper "Exploring Performance and Cost Optimization with ASIC-Based CXL Memory"
☆31Feb 21, 2024Updated 2 years ago
injcristianrojas / UTCClock
View on GitHub
GNOME Shell extension for showing UTC time on the top bar
☆16Mar 15, 2026Updated 4 months ago
prideout / camera_demo
View on GitHub
demo for par_camera_control.h
☆11Nov 22, 2022Updated 3 years ago
selvam85 / Cat-Dog-Classifier
View on GitHub
Cat Dog Classifier
☆16Dec 23, 2018Updated 7 years ago
HPI-Information-Systems / Pollock
View on GitHub
Pollock is a benchmark for data loading on character-delimited files.
☆28Jul 16, 2026Updated last week
sinhaapurva25 / interview-experiences
View on GitHub
Past interview questions.
☆12Feb 19, 2026Updated 5 months ago
SX-Aurora / veda
View on GitHub
VEDA (VE Driver API)
☆22Mar 17, 2026Updated 4 months ago
technosaurus / jsish
View on GitHub
mirror of jsish - A javascript interpreter with 0install, sqlite and websocket support
☆13Dec 8, 2015Updated 10 years ago
Ruddle / rustfield
View on GitHub
Flow field pathfinding in rust
☆13Sep 29, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
UIACC / codeforces-cli
View on GitHub
A command-line tool for codeforces website
☆11Dec 8, 2022Updated 3 years ago
SVizor42 / DE_Zoomcamp
View on GitHub
Solutions for Data Engineering Zoomcamp, Winter 2022.
☆16Apr 22, 2022Updated 4 years ago
five-embeddev / build-and-verify
View on GitHub
Docker Containers and for building and verifiying RISC-V firmware
☆11Aug 16, 2025Updated 11 months ago
dair-ai / data_science_writing_primer
View on GitHub
Writing Primer for Data Scientists
☆18Feb 19, 2020Updated 6 years ago
Apress / beginning-apache-spark-3
View on GitHub
Source Code for 'Beginning Apache Spark 3' by Hien Luu
☆13Oct 14, 2021Updated 4 years ago
jsnell / zlib-bench
View on GitHub
Benchmark script for comparing different versions of zlib
☆15Jul 18, 2017Updated 9 years ago
sdaschner / favorite-coffee
View on GitHub
Coffee recommendation with Neo4j & Quarkus
☆20Mar 26, 2025Updated last year
sparcians / stf_spec
View on GitHub
An open-source Simulation Trace Format specification
☆17Jun 4, 2026Updated last month
XpressAI / xaibo
View on GitHub
Xaibo is a modular agent framework designed for building flexible AI systems with clean protocol-based interfaces.
☆15Jul 16, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vchlum / wireless-hid
View on GitHub
This Gnome shell extension shows the battery of the wireless keyboards and mice in percentages and colors. Multiple devices are supported…
☆25Mar 6, 2026Updated 4 months ago
mgerdes / terrain-generation
View on GitHub
Some terrain generation using a low poly style
☆11Dec 16, 2016Updated 9 years ago
CertifaiAI / time-series-labs
View on GitHub
Hands-on training labs for Deep Learning in Time Series course by CERTIFAI
☆18Aug 8, 2021Updated 4 years ago
iamsomraj / Adapt-Solutions
View on GitHub
This is a repository, containing the solutions of Adapt. I have made this repo for only educational purpose of mine.
☆27Apr 2, 2021Updated 5 years ago
PacktPublishing / Elm-Web-Development
View on GitHub
Elm Web Development, published by Packt
☆13Oct 31, 2022Updated 3 years ago
kimmolinna / duckdb-zig-build
View on GitHub
DuckDB is an in-process SQL OLAP Database Management System
☆15Jun 7, 2026Updated last month
karajan9 / statisticalrethinking
View on GitHub
Working through Statistical Rethinking by Richard McElreath
☆10Sep 1, 2020Updated 5 years ago