vertica / Vertica-Extension-PackagesLinks
User Defined Extensions (UDX) to the Vertica Analytic Database
☆119Updated 2 years ago
Alternatives and similar repositories for Vertica-Extension-Packages
Users that are interested in Vertica-Extension-Packages are comparing it to the libraries listed below
Sorting:
- ☆24Updated 9 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated 2 years ago
- All things Vertica☆62Updated 10 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 2 years ago
- Hive SerDe for CSV☆140Updated 4 years ago
- Spark package for checking data quality☆221Updated 5 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 8 years ago
- A slightly moist lipstick-on-pig clone for Apache Hive☆23Updated last year
- Remedy small files by combining them into larger ones.☆193Updated 3 years ago
- Lightweight Azkaban client☆77Updated 5 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- Benchmark data warehouses under Fivetran-like conditions☆170Updated 2 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆55Updated 8 years ago
- PyAthenaJDBC is an Amazon Athena JDBC driver wrapper for the Python DB API 2.0 (PEP 249).☆95Updated last year
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Updated 9 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Example for an airflow plugin☆49Updated 9 years ago
- Airflow declarative DAGs via YAML☆132Updated last year
- Cloudera Director sample code☆61Updated 5 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆136Updated 2 years ago
- Python client for Hadoop® YARN API☆109Updated 2 years ago
- Official native Python client for the Vertica Analytics Database.☆384Updated 8 months ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆154Updated last year
- A testing framework for Presto☆62Updated 2 months ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆282Updated 6 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- A free electronic book about Apache Hive. The book is geared towards SQL-knowledgeable business users with some advanced tips for devops.…☆103Updated 7 years ago