erezsh / reladiff
High-performance diffing of large datasets across databases
☆388Updated 2 weeks ago
Alternatives and similar repositories for reladiff:
Users that are interested in reladiff are comparing it to the libraries listed below
- A Postgres Proxy Server in Python☆266Updated last month
- PRQL as a DuckDB extension☆273Updated 4 months ago
- DuckDB for streaming data☆309Updated this week
- A Python framework for defining and querying BI models in your data warehouse☆163Updated 2 weeks ago
- 🏃♀️ Minimalist alternative to dbt☆232Updated this week
- Turning PySpark Into a Universal DataFrame API☆354Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆106Updated 2 weeks ago
- Work with your web service, database, and streaming schemas in a single format.☆337Updated 10 months ago
- Light-weight, browser-based ROLAP pivot tables on top of DuckDB-WASM☆314Updated this week
- Dagster Labs' open-source data platform, built with Dagster.☆304Updated this week
- The smallest DuckDB SQL orchestrator on Earth.☆199Updated this week
- Fast SQL formatter/linter☆634Updated this week
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆489Updated this week
- Stream Arrow data into Postgres☆256Updated 9 months ago
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.☆623Updated this week
- An in-process Parquet merge engine for better data warehousing in S3 with MVCC☆138Updated this week
- Incremental Data Processing in PostgreSQL☆151Updated 2 weeks ago
- DuckDB-powered data lake analytics from Postgres☆466Updated this week
- DuckDB HTTP API Server and Query Interface in a Community Extension☆148Updated last month
- DuckDB extension for Delta Lake☆153Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆184Updated last year
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆146Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆212Updated last week
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆99Updated this week
- PyAirbyte brings the power of Airbyte to every Python developer.☆244Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆316Updated last year
- Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.☆819Updated this week
- GlareDB: An analytics DBMS for distributed data☆758Updated this week
- Columnstore Table in Postgres☆468Updated this week
- Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.☆113Updated 3 weeks ago