vitillo/spark-hyperloglog

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vitillo/spark-hyperloglog)

vitillo / spark-hyperloglog

Algebird's HyperLogLog support for Apache Spark.

☆10

Alternatives and similar repositories for spark-hyperloglog

Users that are interested in spark-hyperloglog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zmap / cachehash
View on GitHub
An efficient C hash-table like data structure with static size that evicts LRU object on insertion
☆11Sep 10, 2023Updated 2 years ago
unnipillai / techfest-building-serverless-datalake-on-aws
View on GitHub
2019 aws summit workshop content
☆13Jul 2, 2019Updated 7 years ago
smarx / wazproxy
View on GitHub
Wazproxy is an HTTP proxy written in Node.js that automatically signs requests to Windows Azure blob storage for a given account.
☆17Oct 17, 2012Updated 13 years ago
allwefantasy / godear
View on GitHub
ServiceFramework 示例项目
☆10Apr 2, 2016Updated 10 years ago
snowplow / dbt-snowplow-mobile
View on GitHub
A fully incremental model, that transforms raw mobile event data generated by the Snowplow mobile trackers into a series of derived table…
☆15Apr 7, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
w3cloud / starwebprint
View on GitHub
This sample javascript code, demonstrates how easy it is, to print to a Star Micronics thermal printer like TSP 654ii WebPRNT, using html…
☆26Feb 18, 2015Updated 11 years ago
maropu / datasketches-spark
View on GitHub
Data Sketches for Apache Spark
☆22Dec 22, 2022Updated 3 years ago
lee2018jian / airbnb
View on GitHub
☆20Aug 19, 2018Updated 7 years ago
learosema / ella-math
View on GitHub
Basic Geometry and Linear Algebra library
☆16Feb 14, 2023Updated 3 years ago
BBVA / spark-benchmarks
View on GitHub
Benchmarking suite for Apache Spark
☆16Nov 24, 2017Updated 8 years ago
jhole89 / aws-glue-sbt-quickstart
View on GitHub
Example of how to set SBT up for local development of AWS Glue Scripts
☆16Jan 4, 2021Updated 5 years ago
charlesb / CDF-workshop
View on GitHub
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…
☆19Aug 16, 2019Updated 6 years ago
Factual / beercode-open
View on GitHub
Open-source code backed by the Factual Beer Guarantee
☆17Nov 19, 2015Updated 10 years ago
GoldinGuy / K-Means-TS
View on GitHub
💹 K-Means clustering implementation in TypeScript
☆22Jun 14, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fpmweb / footerMenu
View on GitHub
Simple and easy jQuery plugin. Nice slide up toggle footer menu when scroll down.
☆13Apr 1, 2016Updated 10 years ago
slively / loopback-discover-models
View on GitHub
Simple CLI to discover and write model definitions from an existing datasource.
☆21Jul 19, 2017Updated 9 years ago
chansen / c-timestamp
View on GitHub
Fast RFC 3339 (ISO 8601) timestamp parser and formatter implemented in C, zero dependencies.
☆39Feb 13, 2014Updated 12 years ago
DavidButterfield / SCST-Usermode-Adaptation
View on GitHub
Adaptation of iSCSI-SCST and DRBD software to run entirely in usermode
☆25Oct 1, 2019Updated 6 years ago
mozilla-services / tigerblood
View on GitHub
Deprecated, use https://github.com/mozilla-services/iprepd
☆15May 18, 2018Updated 8 years ago
MrPowers / spark-test-example
View on GitHub
Spark DataFrame transformation and UDF test examples
☆22Feb 13, 2023Updated 3 years ago
columbia / sunlight
View on GitHub
☆20Feb 19, 2016Updated 10 years ago
dstaesse / ouroboros
View on GitHub
Mirror of the Ouroboros packet network repository. Main repository is on codeberg.
☆43May 15, 2026Updated 2 months ago
scify / JedAI-Spark
View on GitHub
☆15Aug 11, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
abrander / ginproxy
View on GitHub
A very simple proxy handler for gin-gonic
☆12Feb 3, 2016Updated 10 years ago
codocedo / tane
View on GitHub
Implementation of TANE for experimental purposes
☆15Apr 29, 2022Updated 4 years ago
dbist / oozie-examples
View on GitHub
sample oozie workflows
☆17Jun 13, 2017Updated 9 years ago
edihasaj / universal-memory-protocol
View on GitHub
Universal Memory Protocol (UMP) - an open standard for agent memory. The third interop layer beside MCP (tools) and A2A (coordination).
☆32Jul 10, 2026Updated last week
tylertreat / InverseBloomFilter
View on GitHub
Concurrent inverse Bloom filter.
☆15Feb 3, 2015Updated 11 years ago
markt-asf / memory-leaks
View on GitHub
Sample code for demonstrating and exploring class loader related memory leaks
☆15Mar 29, 2018Updated 8 years ago
spring-attic / spring-cloud-app-starters-maven-plugins
View on GitHub
☆12Jul 8, 2022Updated 4 years ago
zlobendog / polars_encryption
View on GitHub
A Polars plugin for encrypting and decrypting data using AES-GSM-CIV algorithm in Rust
☆11Jan 8, 2025Updated last year
tvondra / ccnumber
View on GitHub
experimental PostgreSQL data type with encryption off-loaded to a trusted component
☆18Nov 23, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
snowplow-archive / avalanche
View on GitHub
Load testing for event analytics platforms (Snowplow, more coming soon)
☆13May 17, 2016Updated 10 years ago
huntbao / jsonbean
View on GitHub
Extract datatypes from java bean files using pegjs.
☆15Nov 28, 2019Updated 6 years ago
inconshreveable / service
View on GitHub
Run go programs as a service on major platforms.
☆16Aug 11, 2015Updated 10 years ago
RyZenoKelb / Terminux
View on GitHub
🖥️ Modern web-based terminal simulator with authentic Ubuntu interface. Features complete file system, built-in text editor, file downlo…
☆15May 28, 2025Updated last year
xmlking / cdc-kafka-hadoop
View on GitHub
MySQL to NoSQL real time dataflow
☆19Oct 14, 2017Updated 8 years ago
jetoile / resteasy-netty-sample
View on GitHub
Sample of resteasy-netty project
☆17Jun 25, 2015Updated 11 years ago
hbsun2113 / Airbnb_Interview
View on GitHub
In 2019, I prepare for the interview of Airbnb Beijing, the repo includes the coding questions I solved.
☆36Feb 16, 2020Updated 6 years ago