daefresh / awesome-data-temporality
A curated list to help you manage temporal data across many modalities 🚀.
☆110Updated 2 years ago
Alternatives and similar repositories for awesome-data-temporality:
Users that are interested in awesome-data-temporality are comparing it to the libraries listed below
- An in-process Parquet merge engine for better data warehousing in S3 with MVCC☆141Updated 2 months ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- A Higher-Level, Composable SQL☆43Updated this week
- ROAPI user documentation☆54Updated last month
- Apache Hive Metastore in Standalone Mode With Docker☆11Updated 8 months ago
- CLI to create an ER Diagram from DuckDB database files☆119Updated 3 weeks ago
- A Benchmark for Real-Time Analytics Applications☆35Updated last week
- Create and manage data pipes with Meerschaum.☆134Updated last week
- Kuvasz-Streamer is a Postgres-to-Postgres data consolidation and change data capture project.☆133Updated 2 months ago
- sgr (command line client for Splitgraph) and the splitgraph Python library☆322Updated 11 months ago
- Public issue-tracking and feature suggestion for sql-workbench.com☆43Updated 9 months ago
- Vector Arithmetic and Weighted, Variably Randomized Cosine Similarity Search in Postgres☆44Updated 4 years ago
- Postgres extension that speeds up analytics queries by upto 90%☆50Updated 9 months ago
- Data pipelines from re-usable components☆108Updated 2 years ago
- Incremental Data Processing in PostgreSQL☆175Updated last month
- Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.☆81Updated 4 months ago
- Data Mesh Architecture☆74Updated 8 months ago
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆47Updated last week
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆13Updated last year
- Scale to zero Seafowl hosting with Cloud Run☆38Updated last year
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆193Updated last week
- Multi-hop declarative data pipelines☆112Updated last week
- Schema Registry Statistics Tool☆24Updated this week
- Lambda function to serverlessly repartition parquet files in S3☆35Updated this week
- ☆34Updated last year
- CLI for running Airbyte sources & destinations locally without Airbyte server☆32Updated this week
- A curated this list of briefing questions that need to be cleared to lay out a game plan to work on a client's infrastructure, migrate ap…☆77Updated 2 years ago
- Data Tools Subjective List☆83Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆157Updated 4 months ago