arangodb / arangodb-spark-datasourceLinks
ArangoDB Connector for Apache Spark, using the Spark DataSource API
☆14Updated last month
Alternatives and similar repositories for arangodb-spark-datasource
Users that are interested in arangodb-spark-datasource are comparing it to the libraries listed below
Sorting:
- Spark Streaming Checkpoint File Manager for MinIO☆11Updated 2 years ago
- Neo4j foreign data wrapper for Postgresql☆57Updated 3 years ago
- Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from…☆35Updated 2 years ago
- Multiple node presto cluster on docker container☆125Updated 3 years ago
- spark-drools tutorials☆16Updated last year
- Neo4j Kafka Connector☆176Updated this week
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated this week
- Open source graph visualiser.☆178Updated 3 months ago
- ODD Specification is a universal open standard for collecting metadata.☆143Updated 10 months ago
- Jupyter Integration for Flink SQL via Ververica Platform☆43Updated 2 years ago
- Yet another JanusGraph, Cassandra/Scylla and Elasticsearch in Docker Compose setup☆58Updated 5 years ago
- Apache Flink Stateful Functions Playground☆130Updated last year
- An Example Dremio ARP driven connector that supports SQLLite☆19Updated last year
- TypeScript client library for Trino☆40Updated this week
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- A scale demo of Neo4j Fabric spanning up to 1129 machines/shards running a 100TB (LDBC) dataset with 1.2tn nodes and relationships.☆95Updated 7 months ago
- Demos of Materialize, the operational data warehouse.☆52Updated 6 months ago
- ☆20Updated last year
- Code for the fictitious food delivery company GottaEat used in the Pulsar In Action book☆18Updated 3 years ago
- A connector for Apache Spark and PySpark to Dgraph databases.☆43Updated 4 months ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Hadoop, Hive and PrestoDB for deployment using Docker☆27Updated last year
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 2 years ago
- Explore Apache Kafka data pipelines in Kubernetes.☆46Updated 2 months ago
- Pulsar Beam is a streaming service via HTTP built on Apache Pulsar.☆60Updated 3 years ago
- A curated list to help you manage temporal data across many modalities 🚀.☆116Updated 2 years ago
- 🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka☆337Updated last week
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated last week
- GraphQL service for python dataframes and parquet datasets.☆89Updated last week
- ONgDB is an independent fork of Neo4j® Enterprise Edition version 3.4.0.rc02 licensed under AGPLv3 and/or Community Edition licensed unde…☆421Updated last month