Example project showing how to use Hive UDFs in Apache Spark
☆55Apr 23, 2019Updated 7 years ago
Alternatives and similar repositories for spark-hive-udf
Users that are interested in spark-hive-udf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- Preparatory notes for the Cloudera Spark and Hadoop Certification☆18Dec 5, 2018Updated 7 years ago
- This project is mainly for learning and practicing simple HIVE commands in real time scenarios. Here we have taken some sample coffee sho…☆11Mar 1, 2018Updated 8 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆29Apr 16, 2018Updated 8 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- HiveDB is an open source project for horizontally partitioning MySQL systems.☆48Jun 21, 2022Updated 3 years ago
- Online LDA using Hoffman's Python Implementation☆15Nov 14, 2014Updated 11 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Jul 7, 2021Updated 4 years ago
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation☆23Apr 18, 2016Updated 10 years ago
- Some useful custom hive udf functions, especial array, json, math, string functions.☆227Jul 30, 2024Updated last year
- Facebook's Hive UDFs☆275Feb 3, 2026Updated 4 months ago
- ☆17Mar 19, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆19Apr 9, 2020Updated 6 years ago
- jq for Apache Hive☆22Oct 7, 2020Updated 5 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Sep 17, 2025Updated 9 months ago
- Data Science In Investment Banking☆22Sep 20, 2025Updated 8 months ago
- Scheduling Kubernetes Jobs in cluster and Virtual Kubelet☆11Nov 25, 2018Updated 7 years ago
- Stratosphere is now Apache Flink.☆201Dec 16, 2023Updated 2 years ago
- Generate mock data based on an Apache Avro schema and specific cardinality settings☆10Apr 16, 2018Updated 8 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Jan 22, 2019Updated 7 years ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tool for visualizing Apache Oozie pipelines☆13Feb 15, 2016Updated 10 years ago
- Codec for Hadoop adding OpenPGP encryption using Bouncy Castle☆17Aug 18, 2011Updated 14 years ago
- Salesforce plugins☆12Updated this week
- This course introduced me to three cutting-edge technologies for privacy-preserving AI: Federated Learning, Differential Privacy, and Enc…☆11Sep 2, 2019Updated 6 years ago
- ☆11May 16, 2022Updated 4 years ago
- Spark SQL UDF examples☆57Dec 17, 2017Updated 8 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- a list of links to help you make various important architectural decisions☆11Jul 13, 2016Updated 9 years ago
- A React component to implement continuous scrolling (for modern browser).☆17Jan 12, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Different entries to kaggle contests using Apache Spark☆13Jun 5, 2017Updated 9 years ago
- Cloudera Manager datasource for Grafana 3.x☆19Jun 28, 2023Updated 2 years ago
- 使用spring-boot-spark的一个样例☆11Aug 3, 2018Updated 7 years ago
- A walkthrough of setting up a Kinesis Data Analytics for Java Application which ingest streaming JSON data and leverages the Flink Table …☆16Aug 30, 2023Updated 2 years ago
- ☆35May 23, 2019Updated 7 years ago
- ☆15May 31, 2023Updated 3 years ago
- Anomaly detection system for Datadog multiple metrics☆23Nov 11, 2016Updated 9 years ago