Approximate cardinality estimation with HyperLogLog, as a Hive function
☆42Dec 17, 2012Updated 13 years ago
Alternatives and similar repositories for hive-udf
Users that are interested in hive-udf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example using Grafana with Druid☆11Mar 27, 2015Updated 10 years ago
- An application to monitor and drive the Spark JobServer☆12Dec 12, 2014Updated 11 years ago
- Some Spark implementations of clustering algorithms.☆19Nov 13, 2018Updated 7 years ago
- A Pelican plugin to generate PDF resumes automatically from a Pelican page in Markdown☆11Feb 8, 2016Updated 10 years ago
- An experimental lossless data compression program with high compression ratio.☆15Feb 27, 2013Updated 13 years ago
- Searching for an honest classifier☆17Jan 14, 2016Updated 10 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Sep 25, 2014Updated 11 years ago
- sorting algorithms benchmark☆14Aug 14, 2017Updated 8 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Fuzzing compression libraries☆20Jan 10, 2016Updated 10 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Feb 11, 2017Updated 9 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 2 years ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆24Jan 25, 2016Updated 10 years ago
- ☆13May 12, 2021Updated 4 years ago
- Arithmetic coding library☆17Feb 1, 2026Updated last month
- Unix tee, but for Kinesis streams☆12Oct 19, 2021Updated 4 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- Alfred Workflow & AppleScript to close all open system alerts (iCal, etc) without touching the mouse.☆24Oct 24, 2017Updated 8 years ago
- 基于人工神经网络的动漫角色人脸检测☆14May 15, 2017Updated 8 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 5 months ago
- Simple command line application to read/write message to kafka topic using protobuf☆14Mar 27, 2023Updated 2 years ago
- Use Celery (an asynchronous task queue) with a schedule to read a file and print☆12Sep 10, 2021Updated 4 years ago
- Legacy Snowplow website, switched off 25 April 2017☆16May 15, 2017Updated 8 years ago
- Adobe Experience Platform API for humans☆33Mar 18, 2026Updated last week
- Boilerplate project for MOTW Workshop 2015☆10Mar 3, 2016Updated 10 years ago
- a concurrent compiled programming language☆15Jun 9, 2022Updated 3 years ago
- Neural Arithmetic Logic Units(arXiv:1808.00508)☆11Aug 6, 2018Updated 7 years ago
- demo clients☆20Jul 31, 2017Updated 8 years ago
- 👑 Fully on-chain auto-battler game owned by the community☆18Jun 28, 2024Updated last year
- Common components used across the datamountaineer kafka connect connectors☆21Feb 12, 2021Updated 5 years ago
- Algorithmic Trading Pipeline for Online Betting Markets☆19Dec 7, 2022Updated 3 years ago
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆97Nov 14, 2019Updated 6 years ago
- A Ruby toolkit for cloud-friendly ETL☆38Jul 29, 2016Updated 9 years ago
- yggdrash proof of concept☆11Mar 22, 2018Updated 8 years ago
- java completion daemon☆30Jan 11, 2015Updated 11 years ago
- P2P Sports Betting on the Ethereum blockchain☆14Mar 14, 2025Updated last year