assafmendelson / DataSourceV2
☆21Updated 5 years ago
Related projects: ⓘ
- A tool to get better debug info on spark's memory usage☆42Updated 5 years ago
- Developing Spark External Data Sources using the V2 API☆46Updated 6 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Spark Structured Streaming State Tools☆34Updated 4 years ago
- Custom state store providers for Apache Spark☆93Updated 2 years ago
- Spark SQL index for Parquet tables☆132Updated 3 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 3 years ago
- Connector between Spark and InfluxDB.☆23Updated 8 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- spark structured streaming via HTTP communication☆18Updated 2 years ago
- Big Data Toolkit for the JVM☆145Updated 3 years ago
- Scala + Druid: Scruid. A library that allows you to compose queries in Scala, and parse the result back into typesafe classes.☆115Updated 3 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Updated 4 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- Rocksdb state storage implementation for Structured Streaming.☆16Updated 3 years ago
- Edit code in IntelliJ, eval/run in Zeppelin notebook☆18Updated 5 years ago
- An HBase backed Journal for Akka's experimental persistence / event-sourcing☆47Updated 9 years ago
- Apache Calcite Tutorial☆33Updated 8 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28Updated 4 years ago
- A streaming key-value store implementation using native Flink Streaming operators☆22Updated 8 years ago
- How to use Parquet in Flink☆32Updated 7 years ago
- Data model generator based on Scala case classes☆28Updated 3 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 3 years ago
- An experiment to inject a customized parser using SparkSessionExtension☆17Updated 6 years ago
- Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Updated 11 months ago
- ☆50Updated 3 years ago
- Deterministic transactional database layer on top of a stream processing engine☆25Updated 4 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆64Updated 4 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 2 years ago
- Schema Registry integration for Apache Spark☆39Updated last year