airbytehq / airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
☆18,152Updated this week
Alternatives and similar repositories for airbyte
Users that are interested in airbyte are comparing it to the libraries listed below
Sorting:
- Self-serve BI to 10x your data team ⚡️☆4,712Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆19,290Updated this week
- Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeli…☆4,293Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆2,057Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,827Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆13,121Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,569Updated 3 weeks ago
- The Metadata Platform for your Data and AI Stack☆10,606Updated this week
- Privacy and Security focused Segment-alternative, in Golang and React☆4,183Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,023Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,329Updated this week
- Compare tables within or across databases☆2,969Updated last year
- Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x…☆13,004Updated this week
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆11,287Updated this week
- Broadcast, Presence, and Postgres Changes via WebSockets☆7,091Updated this week
- 🔎 Open source distributed and RESTful search engine.☆10,706Updated this week
- OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata rep…☆6,682Updated this week
- Apache Iceberg☆7,359Updated this week
- This repository is a getting started guide to Singer.☆1,300Updated 8 months ago
- A curated list of awesome open source workflow engines☆7,039Updated 2 weeks ago
- Upserts, Deletes And Incremental Processing on Big Data.☆5,769Updated this week
- SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.☆8,497Updated this week
- Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.☆2,100Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆40,052Updated this week
- Open Source Feature Flagging and A/B Testing Platform☆6,559Updated this week
- 🧙 Build, run, and manage data pipelines for integrating and transforming data.☆8,317Updated this week
- Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 600+ plugins. Alternative to Airflow, n8n, Rund…☆17,830Updated this week
- 📊 Cube’s universal semantic layer platform is the next evolution of OLAP technology for AI, BI, spreadsheets, and embedded analytics☆18,491Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,301Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,562Updated last year