daefresh / awesome-data-temporalityLinks
A curated list to help you manage temporal data across many modalities π.
β118Updated 3 years ago
Alternatives and similar repositories for awesome-data-temporality
Users that are interested in awesome-data-temporality are comparing it to the libraries listed below
Sorting:
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.β171Updated 2 years ago
- An in-process Parquet merge engine for better data warehousing in S3 with MVCCβ152Updated 8 months ago
- ROAPI user documentationβ56Updated 7 months ago
- sgr (command line client for Splitgraph) and the splitgraph Python libraryβ323Updated last year
- Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.β84Updated last year
- Lambda function to serverlessly repartition parquet files in S3β38Updated 10 months ago
- SQL Reimagined for the Modern Data Worldβ57Updated this week
- Query processor with proven optimizations, ready to use for your JSON store to query semi-structured data with JSONiq. Can also be used aβ¦β49Updated last week
- In-Memory Analytics for Kafka using DuckDBβ147Updated this week
- Taxi is a language for describing APIs, data models, and how everything relatesβ189Updated last week
- Public issue-tracking and feature suggestion for sql-workbench.comβ57Updated last year
- High-performance diffing of large datasets across databasesβ513Updated 5 months ago
- list and get specific files from remote zip archives without downloading the whole thingβ157Updated last year
- Pushdown compute from Snowflake to DuckDB running on your infrastructureβ204Updated 3 months ago
- DuckDB extension allowing shell commands to be used for input and output.β93Updated last week
- Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functionβ¦β93Updated 4 years ago
- Simple SQL finite state machine for Postgresβ72Updated 8 months ago
- Serverless multi-protocol + multi-destination event collection system.β210Updated last year
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data typeβ¦β66Updated 2 weeks ago
- Data pipelines from re-usable componentsβ107Updated 2 months ago
- Singer.io Tap for PostgreSQL - PipelineWise compatibleβ40Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.β169Updated 4 months ago
- Multi-hop declarative data pipelinesβ124Updated this week
- Work with your web service, database, and streaming schemas in a single format.β350Updated last month
- Incremental Data Processing in PostgreSQLβ219Updated last month
- CLI for running Airbyte sources & destinations locally without Airbyte serverβ34Updated this week
- This repo contains information about DuckDB extensions found on GitHub. Refreshed dailyβ108Updated this week
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- A playground for running duckdb as a stateless query engine over a data lakeβ218Updated 2 years ago
- FUSE-based DuckDB file system π¦β49Updated 7 months ago