MLnick/hive-udf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MLnick/hive-udf)

MLnick / hive-udf

Approximate cardinality estimation with HyperLogLog, as a Hive function

☆42

Alternatives and similar repositories for hive-udf

Users that are interested in hive-udf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

edwardcapriolo / hive-geoip
View on GitHub
GeoIP Functions for hive
☆49Oct 13, 2020Updated 5 years ago
edwardcapriolo / hive-protobuf
View on GitHub
Protobuf input format and Serde support
☆18Mar 2, 2013Updated 13 years ago
edwardcapriolo / IronCount
View on GitHub
☆34Jan 13, 2019Updated 7 years ago
XDgov / weehive
View on GitHub
A minimal Apache Hive server in a Docker image
☆13Dec 24, 2020Updated 5 years ago
quantiply / grafana-druid-wikipedia
View on GitHub
Example using Grafana with Druid
☆11Mar 27, 2015Updated 11 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jatrost / hadoop-binary-analysis
View on GitHub
Framework that makes processing arbitrary binary data in Hadoop easier
☆22Apr 8, 2013Updated 13 years ago
ceteri / slinky
View on GitHub
Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi
☆40Aug 30, 2010Updated 15 years ago
josephxsxn / moya
View on GitHub
Memcached on YARN
☆19Jun 2, 2014Updated 12 years ago
jbooth / maggiefs
View on GitHub
distributed read/write filesystem in go, bound to local mountpoint using go-fuse
☆25Oct 5, 2015Updated 10 years ago
livingsocial / HiveSwarm
View on GitHub
Helpful user defined fuctions / table generating functions for Hive
☆102May 2, 2016Updated 10 years ago
ogrodnek / spark-plug
View on GitHub
scala driver for launching Amazon EMR jobs
☆40Feb 10, 2016Updated 10 years ago
cscotta / recordinality
View on GitHub
Implementation of 'Recordinality' cardinality estimation sketch with distinct value sampling
☆56Aug 20, 2013Updated 12 years ago
kawaa / Beetest
View on GitHub
A super simple utility for testing Apache Hive scripts locally for non-Java developers.
☆73Feb 11, 2017Updated 9 years ago
UrbanOS-Public / kdp
View on GitHub
Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store
☆17Oct 20, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RobinUS2 / presto-bloomfilter
View on GitHub
Bloomfilter support for Facebook Presto (prestodb.io)
☆26Jul 7, 2022Updated 4 years ago
gruter / cloumon-oozie
View on GitHub
oozie designer and job management system
☆22Sep 25, 2012Updated 13 years ago
LinkedInAttic / white-elephant
View on GitHub
Hadoop log aggregator and dashboard
☆190Oct 29, 2013Updated 12 years ago
mintDS / mintds
View on GitHub
Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…
☆24Jan 25, 2016Updated 10 years ago
edwardcapriolo / filecrush
View on GitHub
Remedy small files by combining them into larger ones.
☆196Jul 1, 2022Updated 4 years ago
addthis / stream-lib
View on GitHub
Stream summarizer and cardinality estimator.
☆2,265Nov 28, 2019Updated 6 years ago
pcmanus / cassandra
View on GitHub
Mirror of Apache Cassandra (incubating)
☆15Mar 5, 2025Updated last year
lintool / SparkTutorial
View on GitHub
Spark Tutorial at the University of Maryland
☆37Oct 24, 2014Updated 11 years ago
zaxtax / infer.py
View on GitHub
☆17Jun 27, 2013Updated 13 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ianozsvald / pycon2013_applied_parallel_computing
View on GitHub
Applied Parallel Computing tutorial material for PyCon 2013 (Minesh Amin, Ian Ozsvald)
☆17Apr 2, 2013Updated 13 years ago
dk-stationery / stationery-ink
View on GitHub
Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark
☆12Mar 14, 2016Updated 10 years ago
edwardcapriolo / nibiru
View on GitHub
A NoSql database designed for maximum plugablitily and configurability.
☆18Jul 30, 2025Updated 11 months ago
jdye64 / docker-hwx
View on GitHub
Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components
☆10Oct 11, 2019Updated 6 years ago
ceteri / ceteri-mapred
View on GitHub
MapReduce examples
☆20Nov 18, 2011Updated 14 years ago
alaiacano / dsp-scalding
View on GitHub
Code from my talk on Digital Signal Processing in Hadoop with Scalding
☆15Oct 17, 2013Updated 12 years ago
coreylynch / sklearn-transform
View on GitHub
Collection of scripts for doing common transformations in machine learning
☆21Dec 5, 2012Updated 13 years ago
wg / crypto
View on GitHub
High-performance cryptography for the JVM
☆18Jul 17, 2013Updated 13 years ago
kijiproject / kiji-express
View on GitHub
☆16Sep 26, 2014Updated 11 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
brightcove-archive / ooyala_scamr
View on GitHub
A Hadoop map reduce framework for Scala.
☆15Apr 21, 2016Updated 10 years ago
edwardcapriolo / hive_test
View on GitHub
Unit test framework for hive and hive-service
☆65Jun 29, 2022Updated 4 years ago
s3u / ebay-srp-play
View on GitHub
A demo app used for benchmarking Play Framework against Nodejs
☆17Mar 29, 2011Updated 15 years ago
flumebase / flumebase
View on GitHub
Continuous Streaming SQL Queries for Flume
☆96Dec 30, 2011Updated 14 years ago
paulmw / hive-udf
View on GitHub
☆16Apr 17, 2014Updated 12 years ago
Ctrip-DI / Hue-Ctrip-DI
View on GitHub
Ctrip Data Infrastructure team works for hue
☆16Dec 10, 2014Updated 11 years ago
jpplayer / hdfs-auto-snapshot
View on GitHub
HDFS Automatic Snapshot Service for Linux
☆11Oct 17, 2016Updated 9 years ago