bruin-data / bruinLinks
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
☆969Updated this week
Alternatives and similar repositories for bruin
Users that are interested in bruin are comparing it to the libraries listed below
Sorting:
- Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipeli…☆639Updated this week
- DuckDB for streaming data☆612Updated last month
- Visual Data Preparation and Transformation. Low-Code Python-based ETL.☆1,091Updated 2 weeks ago
- An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI☆467Updated this week
- Stream, transform, and route PostgreSQL data in real-time.☆786Updated this week
- PgQueuer is a Python library leveraging PostgreSQL for efficient job queuing.☆1,340Updated this week
- Durable workflow automation in just a few lines of code☆901Updated this week
- High-performance diffing of large datasets across databases☆473Updated this week
- Open-source Snowflake and Fivetran alternative bundled together☆1,411Updated last week
- Scratch is a swiss army knife for big data.☆1,115Updated last year
- Lightweight Durable Python Workflows☆865Updated this week
- A Python framework for defining and querying BI models in your data warehouse☆169Updated 7 months ago
- A series of top performing Text to SQL LLMs☆866Updated last year
- Open-source BI for engineers☆2,325Updated this week
- Buckaroo - The data table UI for Notebooks. Quickly explore dataframes, scroll through dataframes, search, sort, view summary stats and…☆626Updated 2 weeks ago
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆667Updated this week
- The developer framework for building analytical backends on top of ClickHouse, Redpanda and other high-performance analytical infrastruct…☆334Updated this week
- WhatTheDuck is an open-source web application built on DuckDB. It allows users to upload CSV and Parquet files, store them in tables, and…☆628Updated 2 months ago
- Open source auth infrastructure for B2B SaaS☆899Updated this week
- ingestr is a CLI tool to copy data between any databases with a single command seamlessly.☆3,205Updated this week
- PostgreSQL database anonymization and synthetic data generation tool☆1,519Updated this week
- A project providing a Graphic Walker Pane for use with HoloViz Panel.☆323Updated 5 months ago
- Web-based log viewer UI. Explore logs data stored in ClickHouse or Docker☆598Updated last month
- AI agent expert in PostgreSQL☆836Updated this week
- The simplest way to build AI workloads on Postgres☆806Updated last week
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆654Updated last week
- The Data Change Processing platform☆1,161Updated last week
- Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. …☆415Updated 3 weeks ago
- High Performace IDE for Jupyter Notebooks☆2,212Updated this week
- Metrics Observability & Troubleshooting☆322Updated last year