jgperrin / net.jgp.labs.spark.datasources
Building custom data sources for Apache Spark, in Java.
☆12Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for net.jgp.labs.spark.datasources
- Schema Registry integration for Apache Spark☆39Updated last year
- Spark Example using Phoenix to interact with HBase☆16Updated 8 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆65Updated 7 years ago
- ☆18Updated 5 months ago
- Hadoop Data Pipeline using Falcon☆15Updated 8 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Developing Spark External Data Sources using the V2 API☆46Updated 6 years ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Updated 4 years ago
- Code examples for my blog posts☆22Updated 6 years ago
- A series of demos using HBase Standalone and Phoenix/HBase☆19Updated 9 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 5 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- HDFS Automatic Snapshot Service for Linux☆12Updated 8 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 7 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- Serde for Cobol Layout to Hive table☆24Updated 5 years ago
- spark structured streaming via HTTP communication☆18Updated 2 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- Integration of Iceberg table management into Spark SQL☆11Updated 4 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Updated 4 years ago
- Port of TPC-DS dsdgen to Java☆47Updated 3 months ago
- Cascading on Apache Flink®☆54Updated 9 months ago
- Tachyon service for Ambari☆9Updated 9 years ago
- Kafka Examples repository.☆43Updated 5 years ago
- Apache Spark ETL Utilities☆40Updated 3 weeks ago
- ☆23Updated 6 years ago
- A slightly moist lipstick-on-pig clone for Apache Hive☆23Updated last year
- Spark Structured Streaming State Tools☆34Updated 4 years ago