erezsh / reladiffLinks
High-performance diffing of large datasets across databases
☆450Updated last month
Alternatives and similar repositories for reladiff
Users that are interested in reladiff are comparing it to the libraries listed below
Sorting:
- DuckDB for streaming data☆585Updated 2 weeks ago
- A Postgres Proxy Server in Python☆295Updated 7 months ago
- 🏃♀️ Minimalist SQL orchestrator☆255Updated this week
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆617Updated this week
- PRQL as a DuckDB extension☆292Updated 3 months ago
- Turning PySpark Into a Universal DataFrame API☆413Updated this week
- DuckDB-powered data lake analytics from Postgres☆522Updated 3 months ago
- The smallest DuckDB SQL orchestrator on Earth.☆315Updated 2 months ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆185Updated last week
- Stream Arrow data into Postgres☆266Updated 3 weeks ago
- Work with your web service, database, and streaming schemas in a single format.☆344Updated 3 weeks ago
- A Python framework for defining and querying BI models in your data warehouse☆166Updated 5 months ago
- Run, mock and test fake Snowflake databases locally.☆144Updated last week
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB☆264Updated this week
- The Control Plane for Apache Iceberg.☆278Updated this week
- GigAPI is a Timeseries lakehouse for real-time data and sub-second queries, powered by DuckDB OLAP + Parquet Query Engine, Compactor w/ C…☆283Updated last week
- Catalog, compose, and ship ML—Python simplicity, SQL scale.☆305Updated this week
- Light-weight, browser-based ROLAP pivot tables on top of DuckDB-WASM☆418Updated this week
- DuckDB extension for Delta Lake☆193Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆329Updated 2 years ago
- Metrics Observability & Troubleshooting☆321Updated last year
- Incremental Data Processing in PostgreSQL☆192Updated last month
- An in-process Parquet merge engine for better data warehousing in S3 with MVCC☆148Updated last month
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆122Updated 5 months ago
- Serverless multi-protocol + multi-destination event collection system.☆207Updated 7 months ago
- Python bindings for sqlparser-rs☆191Updated last month
- Quickstart for any service☆155Updated this week
- DuckDB HTTP API Server and Query Interface in a Community Extension☆211Updated last week
- A playground for running duckdb as a stateless query engine over a data lake☆209Updated last year
- Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.☆955Updated this week