G-Research / spark-dgraph-connector
A connector for Apache Spark and PySpark to Dgraph databases.
☆43Updated last month
Related projects: ⓘ
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 4 months ago
- Apache datasketches☆85Updated last year
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆37Updated last year
- Simple project to expose a catalog over REST using a Java catalog backend☆103Updated this week
- Spark Connector to read and write with Pulsar☆111Updated 5 months ago
- Apache Flink Stateful Functions Playground☆127Updated 11 months ago
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆179Updated this week
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- ☆104Updated last year
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated last year
- A library for Spark DataFrame using MinIO Select API☆96Updated 4 years ago
- ☆33Updated last year
- Kubernetes (K8s) Operator for PrestoDB☆45Updated 2 years ago
- Storage connector for Trino☆90Updated 3 weeks ago
- NebulaGraph Exchange is an Apache Spark application to parse data from different sources to NebulaGraph in a distributed environment. It …☆28Updated 2 months ago
- Parsing, AST and semantic analysis for the Cypher Query Language☆53Updated 2 months ago
- ☆28Updated last month
- Gateway to provide HTTP endpoints for the Nebula Graph service.☆26Updated 8 months ago
- Spark SQL listener to record lineage information☆28Updated 3 years ago
- Visualize column-level data lineage in Spark SQL☆85Updated 2 years ago
- Command line interface for the Nebula Graph service☆57Updated 3 months ago
- ☆66Updated 8 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated 10 months ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 3 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆104Updated this week
- Data exporter of Nebula Graph☆17Updated 3 weeks ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆91Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆72Updated 2 months ago