daefresh / awesome-data-temporality
A curated list to help you manage temporal data across many modalities 🚀.
☆104Updated last year
Related projects: ⓘ
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆141Updated 2 weeks ago
- A tool to automatically infer columns data types in .csv files☆33Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 2 years ago
- Analyzing hacker news in real-time with Bytewax and Proton☆39Updated 7 months ago
- ROAPI user documentation☆52Updated 8 months ago
- DuckDB extension allowing shell commands to be used for input and output.☆48Updated last week
- Data Mesh Architecture☆70Updated 2 months ago
- ☆31Updated 6 months ago
- An in-process Parquet merge engine for better data warehousing in S3☆125Updated last month
- Query Snowflake tables locally with DuckDB, without any need for a running warehouse☆62Updated 3 weeks ago
- ☆160Updated this week
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated 7 months ago
- SyncLite : Build Anything Sync Anywhere☆132Updated this week
- Create and manage data pipes with Meerschaum.☆129Updated this week
- A Python framework for defining and querying BI models in your data warehouse☆157Updated 5 months ago
- High-performance diffing of large datasets across databases☆338Updated last week
- ☆18Updated last year
- ☆116Updated last year
- ☆39Updated last month
- Lambda function to serverlessly repartition parquet files in S3☆26Updated 11 months ago
- CLI to create an ER Diagram from DuckDB database files☆59Updated last week
- Singer.io Tap for PostgreSQL - PipelineWise compatible☆41Updated 2 weeks ago
- A toolkit for statistical process control using SQL☆56Updated this week
- A browser-based Parquet file viewer☆38Updated 3 weeks ago
- Work with your web service, database, and streaming schemas in a single format.☆323Updated 5 months ago
- Postgres extension that speeds up analytics queries by upto 90%☆47Updated 3 months ago
- CLI for running Airbyte sources & destinations locally without Airbyte server☆31Updated 3 weeks ago
- Scale to zero Seafowl hosting with Cloud Run☆39Updated last year
- In-Memory Analytics for Kafka using DuckDB☆63Updated this week
- Test data management tool for any data source, batch or real-time☆35Updated last week