DataSQRL / sqrlLinks
Data Streaming Framework to build data APIs, data lakes, and LLM tooling with SQL.
☆114Updated this week
Alternatives and similar repositories for sqrl
Users that are interested in sqrl are comparing it to the libraries listed below
Sorting:
- Multi-hop declarative data pipelines☆115Updated this week
- In-Memory Analytics for Kafka using DuckDB☆122Updated last week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆77Updated last month
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 5 months ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Storage connector for Trino☆110Updated 3 weeks ago
- Java binding to Apache DataFusion☆81Updated last month
- ☆84Updated last week
- ☆105Updated last year
- Apache Calcite Adapter for Apache Kudu☆28Updated 7 months ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆44Updated this week
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.☆25Updated 5 months ago
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆41Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.☆224Updated 2 months ago
- Fork of Apache Kafka implenting KIP-1150 -- Diskless Topics☆25Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆89Updated 3 weeks ago
- ☆80Updated last month
- BigQuery connector for Apache Flink☆31Updated 2 weeks ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆259Updated last week
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆53Updated last week
- Apache Hive Metastore as a Standalone server in Docker☆75Updated 9 months ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- Snowflake connector repository for the Apache Flink project☆37Updated last month
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆65Updated 2 months ago
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆18Updated 3 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated this week
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated 5 months ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆40Updated 8 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Apache Iceberg Documentation Site☆42Updated last year