Example project showing how to use Hive UDFs in Apache Spark
☆55Apr 23, 2019Updated 7 years ago
Alternatives and similar repositories for spark-hive-udf
Users that are interested in spark-hive-udf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- Oozie - workflow engine for Hadoop☆17Jul 8, 2020Updated 5 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- HiveDB is an open source project for horizontally partitioning MySQL systems.☆47Jun 21, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Online LDA using Hoffman's Python Implementation☆15Nov 14, 2014Updated 11 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Aug 27, 2019Updated 6 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆589Jun 26, 2024Updated last year
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation☆23Apr 18, 2016Updated 10 years ago
- Sample code with integration between Data Catalog and Hive data source.☆24Jan 29, 2025Updated last year
- Task Metrics Explorer☆14Apr 2, 2019Updated 7 years ago
- Facebook's Hive UDFs☆277Feb 3, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- ☆17Mar 19, 2024Updated 2 years ago
- A tool to get better debug info on spark's memory usage☆42Aug 21, 2019Updated 6 years ago
- Notes from 100 days with Kubernetes☆31Jan 25, 2019Updated 7 years ago
- An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in …☆19Jun 22, 2021Updated 4 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Sep 17, 2025Updated 7 months ago
- Scheduling Kubernetes Jobs in cluster and Virtual Kubelet☆11Nov 25, 2018Updated 7 years ago
- Stratosphere is now Apache Flink.☆201Dec 16, 2023Updated 2 years ago
- Generate mock data based on an Apache Avro schema and specific cardinality settings☆10Apr 16, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- A boilerplate for DBI drivers, fully DBI-compliant☆11Updated this week
- 平时玩hadoop做的例子。☆10Feb 15, 2017Updated 9 years ago
- Tool for visualizing Apache Oozie pipelines☆13Feb 15, 2016Updated 10 years ago
- MonetDB as a shared library with a C API☆32Jan 13, 2022Updated 4 years ago
- Spark SQL UDF examples☆57Dec 17, 2017Updated 8 years ago
- A React component to implement continuous scrolling (for modern browser).☆17Jan 12, 2017Updated 9 years ago
- Different entries to kaggle contests using Apache Spark☆13Jun 5, 2017Updated 8 years ago
- Python client for Hadoop® YARN API☆109Sep 26, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Hudi Demo Notebook☆11Mar 5, 2024Updated 2 years ago
- 🐶 backed by 🔵 with 💙☆16Jun 4, 2021Updated 4 years ago
- A Spark job that outputs the ranked list of episode to watch based on one preference "keyword"☆14Mar 30, 2018Updated 8 years ago
- ☆35May 23, 2019Updated 6 years ago
- Anomaly detection system for Datadog multiple metrics☆23Nov 11, 2016Updated 9 years ago
- 日本語版wordnetをPythonで扱うためのラッパー☆26Jan 20, 2014Updated 12 years ago
- A fast segmented stack allocator for Rust, supporting multiple objects of any type.☆11Sep 4, 2020Updated 5 years ago