Example project showing how to use Hive UDFs in Apache Spark
☆55Apr 23, 2019Updated 6 years ago
Alternatives and similar repositories for spark-hive-udf
Users that are interested in spark-hive-udf are comparing it to the libraries listed below
Sorting:
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation☆23Apr 18, 2016Updated 9 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Aug 27, 2019Updated 6 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 9 years ago
- Spark(multi versions) + Streaming/Hive/SQL/UDF Demos☆15May 17, 2018Updated 7 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆30Apr 16, 2018Updated 7 years ago
- Oozie - workflow engine for Hadoop☆17Jul 8, 2020Updated 5 years ago
- An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in …☆19Jun 22, 2021Updated 4 years ago
- Some useful custom hive udf functions, especial array, json, math, string functions.☆227Jul 30, 2024Updated last year
- ☆17Mar 19, 2024Updated last year
- Following along with the Hive tutorial at StrataConf / HadoopWorld☆22Mar 22, 2019Updated 6 years ago
- Preparatory notes for the Cloudera Spark and Hadoop Certification☆18Dec 5, 2018Updated 7 years ago
- 日本語版wordnetをPythonで扱うためのラッパー☆26Jan 20, 2014Updated 12 years ago
- Facebook's Hive UDFs☆277Feb 3, 2026Updated last month
- Spark SQL UDF examples☆56Dec 17, 2017Updated 8 years ago
- ☆35May 23, 2019Updated 6 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆590Jun 26, 2024Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Sep 4, 2023Updated 2 years ago
- ☆20Feb 28, 2018Updated 8 years ago
- Hadoop FSImage Analyzer (HFSA)☆66Mar 2, 2026Updated last week
- Mirror of Apache Hivemall (incubating)☆313Sep 6, 2022Updated 3 years ago
- Stratosphere is now Apache Flink.☆199Dec 16, 2023Updated 2 years ago
- Capture changes of HBase to Kafka☆30May 3, 2016Updated 9 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Nov 16, 2022Updated 3 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- ☆36Apr 9, 2025Updated 11 months ago
- NexR Hive UDFs☆113Aug 5, 2015Updated 10 years ago
- Android TV automation framework via ADB command shell and python☆11Sep 30, 2018Updated 7 years ago
- Apache Geode on Kubernetes☆10Oct 19, 2019Updated 6 years ago
- 适合2到6岁的宝宝打字游戏☆10May 29, 2020Updated 5 years ago
- Raspberry Pi Turta röle kartını görsel arayüz üzerinden kontrol eden python dili ile yazılmış program☆11Nov 30, 2016Updated 9 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- Interactive data analysis with Pandas and Treasure Data.☆37Mar 25, 2020Updated 5 years ago
- Demonstrate using MCP with Pydantic AI framework☆14Mar 14, 2025Updated 11 months ago
- Interplanetary Database: A Database built on top of IPFS and made immutable using Ethereum blockchain.☆10Sep 19, 2022Updated 3 years ago