basin-etl / basin
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
☆35Updated last year
Related projects: ⓘ
- Delta reader for the Ray open-source toolkit for building ML applications☆40Updated 7 months ago
- Demos of Materialize, the operational data warehouse.☆50Updated 2 weeks ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆91Updated this week
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 3 years ago
- A curated list of dagster code snippets for data engineers☆48Updated 6 months ago
- Apache Hive Metastore as a Standalone server in Docker☆64Updated 3 weeks ago
- ☆22Updated 2 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆64Updated 3 years ago
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆60Updated this week
- A Minimalistic Rust Implementation of Delta Sharing Server.☆79Updated last month
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆102Updated this week
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆31Updated 5 months ago
- DuckDB for streaming data☆62Updated 5 months ago
- Unity Catalog UI☆40Updated 2 weeks ago
- Data Lineage Tracing Library☆21Updated 2 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 2 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆68Updated last week
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆40Updated 2 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆61Updated last year
- BoilingData JS client (NodeJS and Browsers)☆19Updated last week
- chDB AWS Lambda container☆15Updated last year
- Where the Meltano team runs Meltano! Get it???☆25Updated last month
- Use SQL to build ELT pipelines on a data lakehouse.☆285Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆60Updated last year
- ☆10Updated last year
- ☆26Updated last year
- a collection of resources and blogs about Apache Superset☆78Updated 2 years ago
- Amundsen Gremlin☆20Updated 2 years ago