Approximate cardinality estimation with HyperLogLog, as a Hive function
☆42Dec 17, 2012Updated 13 years ago
Alternatives and similar repositories for hive-udf
Users that are interested in hive-udf are comparing it to the libraries listed below
Sorting:
- Example using Grafana with Druid☆11Mar 27, 2015Updated 10 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- Some Spark implementations of clustering algorithms.☆19Nov 13, 2018Updated 7 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- Boilerplate project for MOTW Workshop 2015☆10Mar 3, 2016Updated 10 years ago
- ☆10May 16, 2022Updated 3 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- Adaptive File Source Connector for Spark, optimised for reading from object stores☆15Oct 18, 2022Updated 3 years ago
- ☆11Apr 17, 2024Updated last year
- The Musketeer workflow manager.☆42Oct 30, 2018Updated 7 years ago
- Java code for Apache Nifi processors☆11Jun 5, 2017Updated 8 years ago
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- A research and review of techniques to provide a natural language interface to RDMS.☆10Dec 8, 2017Updated 8 years ago
- A Pelican plugin to generate PDF resumes automatically from a Pelican page in Markdown☆11Feb 8, 2016Updated 10 years ago
- An experimental lossless data compression program with high compression ratio.☆15Feb 27, 2013Updated 13 years ago
- ☆11Nov 29, 2020Updated 5 years ago
- docker image to deploy rabbitmq cluster on mesos with one marathon app☆10Oct 12, 2017Updated 8 years ago
- Implementation of 'Recordinality' cardinality estimation sketch with distinct value sampling☆55Aug 20, 2013Updated 12 years ago
- Unix tee, but for Kinesis streams☆12Oct 19, 2021Updated 4 years ago
- An application to monitor and drive the Spark JobServer☆12Dec 12, 2014Updated 11 years ago
- Docker Image - Tadpole DB Hub☆14Jul 28, 2021Updated 4 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- Nested lists published on GitHub.☆13Sep 7, 2022Updated 3 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 9 years ago
- forza-telemetry-kafka-producer☆10May 2, 2022Updated 3 years ago
- Sadnbox of Spark-notebook☆10Mar 19, 2016Updated 9 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 5 months ago
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago
- Spring Boot Starter for handling alerts from Prometheus Alertmanager☆11Mar 6, 2023Updated 2 years ago
- Konzepte von Core-Java 8 werden durch beispiele illustriert. Java 8's core concepts are explained by examples.☆12Oct 12, 2018Updated 7 years ago
- Content Data Store (HDFS/HBase)☆13Dec 1, 2016Updated 9 years ago
- An expansive bundle of NiFi additions intended to be used for generating test data☆11Aug 6, 2023Updated 2 years ago
- ☆12Jul 26, 2018Updated 7 years ago
- Spawns JupyterHub single user servers in Marathon☆11Oct 8, 2017Updated 8 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Official GitHub repository of Mashr☆10May 20, 2019Updated 6 years ago
- LDAP to RestAPI Gateway Server☆12Dec 4, 2017Updated 8 years ago
- sorting algorithms benchmark☆14Aug 14, 2017Updated 8 years ago