vertica / Vertica-Extension-PackagesLinks
User Defined Extensions (UDX) to the Vertica Analytic Database
☆119Updated 3 years ago
Alternatives and similar repositories for Vertica-Extension-Packages
Users that are interested in Vertica-Extension-Packages are comparing it to the libraries listed below
Sorting:
- ☆25Updated 10 years ago
- All things Vertica☆62Updated 10 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Updated 8 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 8 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- Hive SerDe for CSV☆140Updated 4 years ago
- Official native Python client for the Vertica Analytics Database.☆384Updated last week
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆293Updated 2 years ago
- Ambari YARN UTILS☆30Updated 2 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Benchmark data warehouses under Fivetran-like conditions☆171Updated 2 years ago
- Spark package for checking data quality☆222Updated 5 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 3 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Updated 7 years ago
- Example for an airflow plugin☆49Updated 9 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Updated 2 years ago
- SQL for many helpful Redshift UDFs, and the scripts for generating and testing those UDFs☆125Updated 7 years ago
- Tool to generate a Hive schema from a JSON example doc☆228Updated 6 years ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆326Updated 4 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆55Updated 8 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- NexR Hive UDFs☆113Updated 10 years ago
- A slightly moist lipstick-on-pig clone for Apache Hive☆23Updated 2 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- DataPipeline for humans.☆250Updated 3 years ago
- PrestoClient implements the client protocol to communicate with a Presto server. There are versions in C, Python and R.☆48Updated 10 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 3 years ago