Approximate cardinality estimation with HyperLogLog, as a Hive function
☆42Dec 17, 2012Updated 13 years ago
Alternatives and similar repositories for hive-udf
Users that are interested in hive-udf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example using Grafana with Druid☆11Mar 27, 2015Updated 11 years ago
- An application to monitor and drive the Spark JobServer☆12Dec 12, 2014Updated 11 years ago
- Implementation of 'Recordinality' cardinality estimation sketch with distinct value sampling☆55Aug 20, 2013Updated 12 years ago
- Some Spark implementations of clustering algorithms.☆19Nov 13, 2018Updated 7 years ago
- A Pelican plugin to generate PDF resumes automatically from a Pelican page in Markdown☆11Feb 8, 2016Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Parallel Weighted Random Sampling☆20Dec 9, 2020Updated 5 years ago
- Searching for an honest classifier☆17Jan 14, 2016Updated 10 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Sep 25, 2014Updated 11 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Fuzzing compression libraries☆20Jan 10, 2016Updated 10 years ago
- Quick and dirty script to automate checkin to a Southwest flight.☆25Jun 22, 2018Updated 7 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 2 years ago
- ☆13May 12, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- hdfs client impl with pure rust☆18Jan 9, 2024Updated 2 years ago
- Read SparkSQL parquet file as RDD[Protobuf]☆93Oct 12, 2018Updated 7 years ago
- Cloudera Manager parcel and CSD to manage Cassandra NoSQL database☆14Nov 16, 2016Updated 9 years ago
- Simple command line application to read/write message to kafka topic using protobuf☆14Mar 27, 2023Updated 3 years ago
- LittleBit is a pure Huffman coding compression algorithm with the option of random access reading while offering competitive compression …☆14Jul 4, 2021Updated 4 years ago
- Legacy Snowplow website, switched off 25 April 2017☆16May 15, 2017Updated 8 years ago
- Adobe Experience Platform API for humans☆35Apr 1, 2026Updated last week
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于hanny的友情链接插件做修改,使其在Material Theme中保持原有风格☆11Sep 23, 2017Updated 8 years ago
- A vim plugin to query Stack Overflow☆25May 17, 2022Updated 3 years ago
- Common components used across the datamountaineer kafka connect connectors☆21Feb 12, 2021Updated 5 years ago
- A Ruby toolkit for cloud-friendly ETL☆38Jul 29, 2016Updated 9 years ago
- java completion daemon☆30Jan 11, 2015Updated 11 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Mar 16, 2016Updated 10 years ago
- This convolutional neural network can say if an input image is a chat screen-shot or a normal image☆22Mar 20, 2021Updated 5 years ago
- FastAPI authorization middleware based on PyCasbin☆21Mar 30, 2026Updated 2 weeks ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...☆19Dec 7, 2017Updated 8 years ago
- Rust implementation of Apache ORC☆29Apr 3, 2026Updated last week
- 🐍本项目为 Consul 的使用 Demo☆13Dec 8, 2022Updated 3 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- DEMO程序,可参考分享组件常用接口方法☆23Feb 10, 2015Updated 11 years ago
- Nested lists published on GitHub.☆13Sep 7, 2022Updated 3 years ago