vertica / Vertica-Extension-PackagesLinks
User Defined Extensions (UDX) to the Vertica Analytic Database
☆119Updated 2 years ago
Alternatives and similar repositories for Vertica-Extension-Packages
Users that are interested in Vertica-Extension-Packages are comparing it to the libraries listed below
Sorting:
- ☆24Updated 9 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- All things Vertica☆62Updated 10 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated 2 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Updated 8 years ago
- Hive SerDe for CSV☆140Updated 4 years ago
- Spark package for checking data quality☆221Updated 5 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆282Updated 7 years ago
- Airflow script for incremental data import from Mysql to Hive using Sqoop.☆18Updated 7 years ago
- A slightly moist lipstick-on-pig clone for Apache Hive☆23Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆90Updated last year
- A Python Module to make it easy to script powerful interactions with Teradata Database in a DevOps friendly way.☆108Updated 3 years ago
- Vertica Kit☆69Updated 10 years ago
- Benchmark data warehouses under Fivetran-like conditions☆170Updated 2 years ago
- Official native Python client for the Vertica Analytics Database.☆384Updated last month
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 3 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated 2 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 2 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆55Updated 8 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- A testing framework for Presto☆63Updated 4 months ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆260Updated last year
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Tool to generate a Hive schema from a JSON example doc☆227Updated 5 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year