maropu/datasketches-spark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maropu/datasketches-spark)

maropu / datasketches-spark

Data Sketches for Apache Spark

☆22

Alternatives and similar repositories for datasketches-spark

Users that are interested in datasketches-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

techsuppdiva / spark-cheat-sheets
View on GitHub
This repo stores my Spark Tutorial slides.
☆15Feb 8, 2016Updated 10 years ago
amundsen-io / amundsengremlin
View on GitHub
Amundsen Gremlin
☆22Aug 26, 2022Updated 3 years ago
smarx / wazproxy
View on GitHub
Wazproxy is an HTTP proxy written in Node.js that automatically signs requests to Windows Azure blob storage for a given account.
☆17Oct 17, 2012Updated 13 years ago
steveloughran / zero-rename-committer
View on GitHub
Paper: A Zero-rename committer for object stores
☆20Nov 7, 2025Updated 8 months ago
stefan-hoeck / cyby2
View on GitHub
A library for writing chemical and biological data management systems
☆10Oct 24, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
irajhedayati / savro
View on GitHub
Some Avro operations in Scala
☆10Jun 29, 2026Updated last month
tony612 / kexplain
View on GitHub
Kexplain is an interactive kubectl explain
☆12Oct 23, 2023Updated 2 years ago
PeterMosmans / ansible-role-bootstrap
View on GitHub
Ansible role for bootstrapping a server installation
☆10Apr 12, 2022Updated 4 years ago
maropu / spark-sql-flow-plugin
View on GitHub
Visualize column-level data lineage in Spark SQL
☆92May 13, 2022Updated 4 years ago
cloudera / flink-basic-auth-handler
View on GitHub
flink-basic-auth-handler
☆14Mar 10, 2025Updated last year
47degrees / kotlin-for-scala-devs
View on GitHub
A brief presentation comparing Scala with Kotlin aimed toward Scala FP devs at 47 Degrees
☆40Dec 23, 2019Updated 6 years ago
carrot / terraform-api-gateway-method-module
View on GitHub
Ease the pain of resourcing an API Gateway method.
☆14Jan 12, 2017Updated 9 years ago
sukaiyi / skyutils
View on GitHub
A fast way to reach commonly used operation
☆10Feb 6, 2018Updated 8 years ago
supercoderz / redis_kernel
View on GitHub
A simple kernel to interact with a redis database from IPython
☆22Sep 25, 2017Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
uzh / fox
View on GitHub
A framework for PSL inference.
☆22Nov 9, 2015Updated 10 years ago
goji / context
View on GitHub
x/net/context bridge for Goji 1 (old). Try https://github.com/goji/goji instead!
☆16Jan 22, 2016Updated 10 years ago
EncodePanda / spark-workshop
View on GitHub
☆14May 23, 2017Updated 9 years ago
tdoehmen / hypoparsr
View on GitHub
☆27Jan 31, 2019Updated 7 years ago
lhbench / lhbench
View on GitHub
Lakehouse storage system benchmark
☆82Feb 22, 2023Updated 3 years ago
susimsek / keycloak-blockchain-user-federation
View on GitHub
Hyperledger Fabric Keycloak User Federation
☆22Jan 13, 2023Updated 3 years ago
NVIDIA / cudf-spark-tools
View on GitHub
User tools for Spark RAPIDS
☆70Jul 20, 2026Updated last week
xjieinfo / xjgo
View on GitHub
☆12Dec 7, 2021Updated 4 years ago
stephanecollot / sparkmon
View on GitHub
Spark Monitoring
☆14Feb 28, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Workiva / go-hystrimp
View on GitHub
An ergonomic implementation of Hystrix fault-tolerance principles for Go developers.
☆19Oct 16, 2017Updated 8 years ago
emmalanguage / emma
View on GitHub
A quotation-based Scala DSL for scalable data analysis.
☆65Jul 7, 2022Updated 4 years ago
StarRocks / fe-plugins-auditloader
View on GitHub
AuditLoader plugin for FE
☆14May 22, 2026Updated 2 months ago
blaze-init / spark-blaze-extension
View on GitHub
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
☆11Apr 23, 2022Updated 4 years ago
apache / datasketches-postgresql
View on GitHub
PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp
☆94May 15, 2026Updated 2 months ago
shuoGG1239 / SqlToXXX
View on GitHub
根据sql文件生成javabean和数据模型啥的
☆11Nov 23, 2025Updated 8 months ago
anon-real / ErgoAuctionHouse
View on GitHub
Decentralized auction on top of ERGO.
☆32May 17, 2023Updated 3 years ago
apache / kyuubi-client
View on GitHub
Client libraries of end users of Apache Kyuubi
☆11May 15, 2026Updated 2 months ago
that-recsys-lab / librec-auto
View on GitHub
Python wrapper for LibRec and other recommendation frameworks.
☆28Oct 11, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ttscoff / starter-book
View on GitHub
An online book.
☆11Jan 24, 2015Updated 11 years ago
elsevierlabs-os / NotebookDiscovery
View on GitHub
Notebook Discovery Tool for Databricks notebooks
☆19Jul 14, 2022Updated 4 years ago
tribbloid / shapesafe
View on GitHub
SHAPE/S∀F∃: static prover/type-checker for N-D array programming in Scala, a use case of intuitionistic type theory
☆32Sep 9, 2025Updated 10 months ago
trib3 / leakycauldron
View on GitHub
☆10Updated this week
datapolitan / lede_algorithms
View on GitHub
☆16May 8, 2017Updated 9 years ago
dm4ml / gate
View on GitHub
Drift detection module for machine learning pipelines.
☆25Jun 21, 2023Updated 3 years ago
wikibook / doodle-ccpp
View on GitHub
《두들낙서의 C/C++ 한꺼번에 배우기》 예제 코드
☆14May 24, 2021Updated 5 years ago