Approximate cardinality estimation with HyperLogLog, as a Hive function
☆42Dec 17, 2012Updated 13 years ago
Alternatives and similar repositories for hive-udf
Users that are interested in hive-udf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example using Grafana with Druid☆11Mar 27, 2015Updated 11 years ago
- An application to monitor and drive the Spark JobServer☆11Dec 12, 2014Updated 11 years ago
- Implementation of 'Recordinality' cardinality estimation sketch with distinct value sampling☆55Aug 20, 2013Updated 12 years ago
- A Pelican plugin to generate PDF resumes automatically from a Pelican page in Markdown☆11Feb 8, 2016Updated 10 years ago
- Parallel Weighted Random Sampling☆21Dec 9, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 在线阅读本项目已翻译手册页☆11Aug 11, 2015Updated 10 years ago
- Searching for an honest classifier☆17Jan 14, 2016Updated 10 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆24Sep 25, 2014Updated 11 years ago
- Useful reusable pipeline components for Crunch jobs☆27Feb 10, 2015Updated 11 years ago
- Fuzzing compression libraries☆20Jan 10, 2016Updated 10 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Feb 11, 2017Updated 9 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 2 years ago
- ☆13May 12, 2021Updated 5 years ago
- Arithmetic coding library☆17Apr 15, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unix tee, but for Kinesis streams☆12Oct 19, 2021Updated 4 years ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- A research and review of techniques to provide a natural language interface to RDMS.☆10Dec 8, 2017Updated 8 years ago
- hdfs client impl with pure rust☆19Jan 9, 2024Updated 2 years ago
- 基于人工神经网络的动漫角色人脸检测☆14May 15, 2017Updated 9 years ago
- Cloudera Manager parcel and CSD to manage Cassandra NoSQL database☆14Nov 16, 2016Updated 9 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 7 months ago
- Simple command line application to read/write message to kafka topic using protobuf☆14Mar 27, 2023Updated 3 years ago
- Legacy Snowplow website, switched off 25 April 2017☆15May 15, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A mysql plugin to to use the hyperloglog algorithm☆27Nov 6, 2016Updated 9 years ago
- Some tools for working with digraphs, partial orders and topological sorting with Python☆12Sep 7, 2011Updated 14 years ago
- HyperLogLog (original and hyperloglog++) algorithm implementation in java.☆82Mar 9, 2021Updated 5 years ago
- Neural Arithmetic Logic Units(arXiv:1808.00508)☆11Aug 6, 2018Updated 7 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- demo clients☆20Jul 31, 2017Updated 8 years ago
- 👑 Fully on-chain auto-battler game owned by the community☆18Jun 28, 2024Updated last year
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- A Ruby toolkit for cloud-friendly ETL☆37Jul 29, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- yggdrash proof of concept☆11Mar 22, 2018Updated 8 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆97Nov 14, 2019Updated 6 years ago
- LR and FM (with sgd or ftrl) model☆25Jun 1, 2016Updated 9 years ago
- Provides users a possibility to train their own GOTURN tracker CNN model on custom data☆11Jul 27, 2017Updated 8 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Mar 16, 2016Updated 10 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago