tobilg / serverless-parquet-repartitionerLinks
Lambda function to serverlessly repartition parquet files in S3
☆36Updated 3 months ago
Alternatives and similar repositories for serverless-parquet-repartitioner
Users that are interested in serverless-parquet-repartitioner are comparing it to the libraries listed below
Sorting:
- ☆52Updated this week
- An example of how to run DuckDB on AWS Lambda & API Gateway.☆156Updated last month
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Time series forecasting with DuckDB and Evidence☆41Updated 8 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆37Updated 3 weeks ago
- DuckDB Cron Expression Extension☆25Updated last year
- Packaging DuckDB for Node.js Lambda functions. Example application: https://github.com/tobilg/serverless-duckdb☆120Updated 3 weeks ago
- Public issue-tracking and feature suggestion for sql-workbench.com☆51Updated last year
- Inspect Your Servers with DuckDB☆30Updated 2 months ago
- BoilingData JS client (NodeJS and Browsers)☆19Updated 9 months ago
- DuckDB WebMacro: Share and Load your SQL Macros via gists☆12Updated 6 months ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆185Updated last week
- DuckDB extension allowing shell commands to be used for input and output.☆75Updated last month
- Running DuckDB behind a Hono.js API☆82Updated 3 weeks ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆96Updated this week
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆91Updated 4 months ago
- ERPL is a DuckDB extension to integrate Enterprise Data in your Data Science and ML pipelines within minutes! ERPL connects DuckDB to SAP…☆43Updated 3 weeks ago
- Extension for DuckDB for functions that require the Apache Arrow dependency☆43Updated 2 months ago
- An in-process Parquet merge engine for better data warehousing in S3 with MVCC☆148Updated last month
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆83Updated 4 months ago
- A curated list of awesome SQLMesh resources☆36Updated 2 months ago
- This repository is made as read-only filesystem for remote access.☆81Updated 3 weeks ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- ☆146Updated 2 months ago
- ☆90Updated last year
- A playground for running duckdb as a stateless query engine over a data lake☆209Updated last year
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆55Updated this week
- A serverless duckDB deployment at GCP☆39Updated 2 years ago
- High throughput streaming of Protobuf data from Kafka into DuckDB☆11Updated last week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆206Updated 3 weeks ago