Quickstart for any service
☆167Mar 3, 2026Updated this week
Alternatives and similar repositories for insta-infra
Users that are interested in insta-infra are comparing it to the libraries listed below
Sorting:
- Compare different technologies. No BS and all sources linked.☆14May 4, 2024Updated last year
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆73Feb 14, 2026Updated 3 weeks ago
- ☆16Nov 27, 2025Updated 3 months ago
- An example repository showing how to leverage Kafka to stream your data☆21May 11, 2024Updated last year
- Create data pipeline with sqlmesh orchestrated by dagster☆28Oct 27, 2025Updated 4 months ago
- FInal project for data zoom camp 2024☆16Mar 31, 2024Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated last year
- High throughput streaming of Protobuf data from Kafka into DuckDB☆12Updated this week
- Creating a REST API with Python on Synapse Serverless pools using external tables☆12Dec 27, 2021Updated 4 years ago
- A Kanban CLI tool. Create boards, add cards to them or remove them. Built in Python using Typer, it stores cards as JSON data, trivially…☆15Apr 3, 2025Updated 11 months ago
- ☆12Feb 23, 2022Updated 4 years ago
- Arrow-Powered Data Exchange☆15Feb 7, 2025Updated last year
- ☆12Jun 25, 2024Updated last year
- A helm chart for Prefect☆14Jun 4, 2020Updated 5 years ago
- Protobuf to Arrow, using Rust☆24Updated this week
- A repository for the Machine Learning Engineering for Production Specialization provided by Deeplearning.ai .☆12Aug 5, 2021Updated 4 years ago
- Community supported integrations for the Dagster platform.☆48Feb 11, 2026Updated 3 weeks ago
- ☆15Mar 10, 2024Updated last year
- Go library for decoding generic map values and native Go structures into Arrow.☆17Jan 30, 2026Updated last month
- ☆16Apr 8, 2025Updated 11 months ago
- Prefect integration with OpenMetadata☆36Apr 25, 2024Updated last year
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆4,980Updated this week
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Nov 16, 2021Updated 4 years ago
- Support code for Indicium Engineering blog series Dagster Power User.☆19Jul 19, 2024Updated last year
- ☆19May 11, 2025Updated 9 months ago
- Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…☆25Mar 27, 2024Updated last year
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 6 months ago
- ☆23Jan 25, 2023Updated 3 years ago
- Core system of Emitbase☆27Oct 23, 2023Updated 2 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Dec 9, 2023Updated 2 years ago
- Simplistic and minimalist storage.☆24May 11, 2025Updated 9 months ago
- Iceberg Playground in a Box☆67Jun 27, 2025Updated 8 months ago
- ☆21Aug 16, 2024Updated last year
- prefect integration for running dbt☆64Sep 3, 2024Updated last year
- ☆21Aug 8, 2024Updated last year
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.☆353Sep 30, 2025Updated 5 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset☆54Dec 13, 2025Updated 2 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆164Jul 17, 2024Updated last year
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆120Mar 5, 2025Updated last year