Approximate cardinality estimation with HyperLogLog, as a Hive function
☆42Dec 17, 2012Updated 13 years ago
Alternatives and similar repositories for hive-udf
Users that are interested in hive-udf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example using Grafana with Druid☆11Mar 27, 2015Updated 11 years ago
- An application to monitor and drive the Spark JobServer☆11Dec 12, 2014Updated 11 years ago
- A Pelican plugin to generate PDF resumes automatically from a Pelican page in Markdown☆11Feb 8, 2016Updated 10 years ago
- ☆13Dec 25, 2020Updated 5 years ago
- Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark☆15Oct 6, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Searching for an honest classifier☆17Jan 14, 2016Updated 10 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆24Sep 25, 2014Updated 11 years ago
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 3 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Feb 11, 2017Updated 9 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 3 years ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆24Jan 25, 2016Updated 10 years ago
- Advent of Code 2021 (Elixir + Pygame)☆16Apr 4, 2022Updated 4 years ago
- Unix tee, but for Kinesis streams☆12Oct 19, 2021Updated 4 years ago
- Arithmetic coding library☆17Jun 8, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- Alfred Workflow & AppleScript to close all open system alerts (iCal, etc) without touching the mouse.☆24Oct 24, 2017Updated 8 years ago
- hdfs client impl with pure rust☆19Jan 9, 2024Updated 2 years ago
- Read SparkSQL parquet file as RDD[Protobuf]☆93Oct 12, 2018Updated 7 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 8 months ago
- Use Celery (an asynchronous task queue) with a schedule to read a file and print☆12Sep 10, 2021Updated 4 years ago
- SBT plugin for creating and managing AWS CloudFormation stacks☆11Jan 8, 2018Updated 8 years ago
- Adobe Experience Platform API for humans☆35Jun 2, 2026Updated 2 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Reading and writing data types of arbitrary bit length that might not be byte-aligned☆22May 21, 2025Updated last year
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- demo clients☆20Jul 31, 2017Updated 8 years ago
- 👑 Fully on-chain auto-battler game owned by the community☆18Jun 28, 2024Updated last year
- Common components used across the datamountaineer kafka connect connectors☆21Feb 12, 2021Updated 5 years ago
- Algorithmic Trading Pipeline for Online Betting Markets☆19Dec 7, 2022Updated 3 years ago
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆97Nov 14, 2019Updated 6 years ago
- java completion daemon☆30Jan 11, 2015Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- P2P Sports Betting on the Ethereum blockchain☆14Mar 14, 2025Updated last year
- Provides users a possibility to train their own GOTURN tracker CNN model on custom data☆11Jul 27, 2017Updated 8 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Mar 16, 2016Updated 10 years ago
- This convolutional neural network can say if an input image is a chat screen-shot or a normal image☆22Mar 20, 2021Updated 5 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...☆19Dec 7, 2017Updated 8 years ago