inquidia / ParquetPluginLinks
☆13Updated 9 years ago
Alternatives and similar repositories for ParquetPlugin
Users that are interested in ParquetPlugin are comparing it to the libraries listed below
Sorting:
- ☆103Updated 5 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆265Updated 2 years ago
- Data Lineage Tracking And Visualization Solution☆645Updated last week
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆295Updated 2 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆283Updated this week
- Example to create lineage in Atlas with sqoop and spark☆14Updated 8 years ago
- ☆23Updated 7 years ago
- StreamSets Tutorials☆351Updated last year
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆282Updated 7 years ago
- DataQuality for BigData☆144Updated last year
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 7 years ago
- A collection of templates for use with Apache NiFi.☆280Updated 8 years ago
- Kite SDK☆393Updated 3 years ago
- Spark connector for SFTP☆98Updated 2 years ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago
- Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version☆271Updated last year
- Mirror of Apache Bahir☆335Updated 2 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆179Updated 3 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆240Updated 10 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 3 years ago
- Schema Registry☆17Updated last year
- NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.☆117Updated last week
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 8 years ago
- docker for apache-atlas embedded-cassandra-solr☆23Updated 6 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆1,114Updated 2 years ago
- Ambari service for Apache Flink☆127Updated 4 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆280Updated 2 years ago
- Apache NiFi example flows☆209Updated 5 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated 2 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆302Updated last week