airbytehq / airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
☆17,449Updated this week
Alternatives and similar repositories for airbyte:
Users that are interested in airbyte are comparing it to the libraries listed below
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,481Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆1,980Updated this week
- The Metadata Platform for your Data and AI Stack☆10,383Updated this week
- Business intelligence as code: build fast, interactive data visualizations in SQL and markdown☆4,909Updated this week
- Self-serve BI to 10x your data team ⚡️☆4,512Updated this week
- lakeFS - Data version control for your data lake | Git for data☆4,566Updated this week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆41,129Updated this week
- Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeli…☆4,232Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,519Updated this week
- Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x…☆12,430Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆27,038Updated this week
- Compare tables within or across databases☆2,962Updated 9 months ago
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆10,959Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,565Updated 10 months ago
- SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-so…☆20,887Updated this week
- Postgres with GPUs for ML/AI apps.☆6,184Updated 2 weeks ago
- Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.☆2,051Updated this week
- Build data pipelines, the easy way 🛠️☆4,113Updated last year
- Apache Iceberg☆6,996Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,031Updated this week
- 🧙 Build, run, and manage data pipelines for integrating and transforming data.☆8,182Updated this week
- 📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics☆18,309Updated this week
- Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!☆10,081Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆18,553Updated this week
- Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.☆11,150Updated this week
- OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata rep…☆6,229Updated this week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆8,645Updated this week
- Apache Pinot - A realtime distributed OLAP datastore☆5,660Updated this week
- The official repository is hosted on https://gitlab.com/bramw/baserow. Baserow is an open source no-code database tool and Airtable alter…☆2,558Updated this week
- Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, C…☆16,415Updated this week