cartershanklin / csv-to-orc
Convert a CSV fle to ORCFile
☆26Updated 5 years ago
Alternatives and similar repositories for csv-to-orc:
Users that are interested in csv-to-orc are comparing it to the libraries listed below
- Presto K8S Operator☆9Updated 4 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆71Updated 2 years ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Updated last year
- A Spark metrics sink that pushes to InfluxDb☆51Updated 4 years ago
- Cloudbreak Deployer Tool☆34Updated last year
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 7 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 4 years ago
- Java event logs collector for hadoop and frameworks☆39Updated 3 weeks ago
- Cloudera Manager datasource for Grafana 3.x☆19Updated last year
- Ansible playbooks to construct distributed computing environments☆62Updated 3 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆62Updated last year
- These are some code examples☆55Updated 5 years ago
- ☆9Updated 9 years ago
- Utilities for writing tests that use Apache Spark.☆24Updated 6 years ago
- Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.☆14Updated 9 years ago
- type-class based data cleansing library for Apache Spark SQL☆79Updated 5 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 6 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 3 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆35Updated 2 months ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- ☆26Updated 5 years ago
- ☆9Updated 9 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Allows wrapping existing WebUI pages and present them as Ambari Views☆9Updated 9 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Updated 6 years ago