ClusterlessHQ / clusterless
Clusterless is a tool for scheduling decentralized, scalable, and secure data pipelines for continuously arriving data, across clouds.
☆12Updated 2 weeks ago
Alternatives and similar repositories for clusterless:
Users that are interested in clusterless are comparing it to the libraries listed below
- Extension for DuckDB for functions that require the Apache Arrow dependency☆39Updated 2 months ago
- An open-source, community-driven REST catalog for Apache Iceberg!☆26Updated 9 months ago
- ☆36Updated 2 weeks ago
- Inspect Your Servers with DuckDB☆30Updated 2 years ago
- A Benchmark for Real-Time Analytics Applications☆14Updated this week
- ☆25Updated last week
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆47Updated last week
- Idempotent query executor☆51Updated 3 weeks ago
- Apache Parquet Testing☆56Updated this week
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆40Updated 2 weeks ago
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Apache Arrow PostgreSQL connector☆58Updated last year
- ☆105Updated last year
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆32Updated 2 years ago
- sql-logic-test☆60Updated last year
- DuckDB Extension Linearization/Delinearization, Z-Order, Hilbert and Morton Curves☆41Updated last week
- In-Memory Analytics for Kafka using DuckDB☆110Updated this week
- Multi-hop declarative data pipelines☆112Updated last week
- DuckDB is an in-process SQL OLAP Database Management System☆42Updated last week
- Apache Arrow Ballista Python bindings☆37Updated last year
- A Minimalistic Rust Implementation of Delta Sharing Server.☆89Updated 2 weeks ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- ☆32Updated last year
- The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…☆16Updated 3 months ago
- Standard ML interpreter, with relational extensions, implemented in Java☆33Updated this week
- Flexible development framework for building streaming data applications in SQL with Kafka, Flink, Postgres, GraphQL, and more.☆102Updated this week
- Bridge between Go and Python to facilitate zero-copy using Apache Arrow☆19Updated 5 years ago
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆41Updated 6 months ago
- Data Catalog for Databases and Data Warehouses☆33Updated last year
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 4 years ago