jackghm / VerticaLinks
All things Vertica
☆62Updated 10 years ago
Alternatives and similar repositories for Vertica
Users that are interested in Vertica are comparing it to the libraries listed below
Sorting:
- User Defined Extensions (UDX) to the Vertica Analytic Database☆119Updated 2 years ago
- ☆24Updated 9 years ago
- Vertica Kit☆69Updated 10 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- A command-line tool for launching Apache Spark clusters.☆647Updated 8 months ago
- Random implementation notes☆34Updated 12 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆260Updated last year
- An open-source, vendor-neutral data context service.☆160Updated 7 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Updated 9 years ago
- Hadoop output committers for S3☆111Updated 5 years ago
- Spark package for checking data quality☆221Updated 5 years ago
- Official native Python client for the Vertica Analytics Database.☆385Updated 2 weeks ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆55Updated 8 years ago
- A slightly moist lipstick-on-pig clone for Apache Hive☆23Updated last year
- Iceberg is a table format for large, slow-moving tabular data☆481Updated 2 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated 2 years ago
- Apache Drill Workshop☆19Updated 9 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆364Updated 8 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Updated 8 years ago
- [DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine☆109Updated 5 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Updated 10 years ago
- Scripts to validate that a cluster is ready for MapR Data Platform installation☆85Updated 5 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆282Updated 7 years ago
- This repository hold the Amazon Elastic MapReduce sample bootstrap actions☆613Updated 2 years ago
- DataPipeline for humans.☆249Updated 3 years ago
- Tool to generate a Hive schema from a JSON example doc☆227Updated 5 years ago