vertica / Vertica-Extension-Packages
User Defined Extensions (UDX) to the Vertica Analytic Database
☆119Updated 2 years ago
Alternatives and similar repositories for Vertica-Extension-Packages:
Users that are interested in Vertica-Extension-Packages are comparing it to the libraries listed below
- ☆24Updated 9 years ago
- All things Vertica☆62Updated 10 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 2 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 8 years ago
- Spark package for checking data quality☆221Updated 5 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 5 years ago
- NexR Hive UDFs☆111Updated 9 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- PrestoClient implements the client protocol to communicate with a Presto server. There are versions in C, Python and R.☆48Updated 9 years ago
- Remedy small files by combining them into larger ones.☆193Updated 2 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- ☆63Updated 5 years ago
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- Kafka as Hive Storage☆66Updated 10 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 9 years ago
- A slightly moist lipstick-on-pig clone for Apache Hive☆23Updated last year
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 8 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated last year
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- Example for an airflow plugin☆49Updated 8 years ago
- Vertica Kit☆69Updated 9 years ago
- Lightweight Azkaban client☆77Updated 5 years ago
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Updated last year
- Port of TPC-DS dsdgen to Java☆48Updated 7 months ago
- Random implementation notes☆33Updated 11 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Updated 7 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Hive UDFs for funnel analysis☆83Updated last year
- ☆209Updated 8 years ago