jackghm / Vertica
All things Vertica
☆62Updated 10 years ago
Alternatives and similar repositories for Vertica:
Users that are interested in Vertica are comparing it to the libraries listed below
- Vertica Kit☆69Updated 9 years ago
- ☆24Updated 9 years ago
- User Defined Extensions (UDX) to the Vertica Analytic Database☆119Updated 2 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 5 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- An open-source, vendor-neutral data context service.☆159Updated 7 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 5 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆95Updated 5 years ago
- Hadoop output committers for S3☆108Updated 4 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Updated 3 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Updated 8 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 8 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 5 years ago
- vertica dialect for sqlalchemy☆12Updated 9 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- Spark SQL index for Parquet tables☆134Updated 3 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- Dockerfile to build image of Vertica Community Edition.☆21Updated 7 years ago
- Random implementation notes☆33Updated 11 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Updated 10 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 8 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 4 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆185Updated 2 years ago
- Cache File System optimized for columnar formats and object stores☆183Updated 2 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- Performant Redshift data source for Apache Spark☆138Updated 2 months ago
- Spark package for checking data quality☆221Updated 5 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆283Updated 6 years ago