dataforgelabs / dataforge-coreLinks
DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles
☆49Updated 3 weeks ago
Alternatives and similar repositories for dataforge-core
Users that are interested in dataforge-core are comparing it to the libraries listed below
Sorting:
- Boiling Insights - From raw S3 data to charts in seconds☆18Updated 5 months ago
- BoilingData JS client (NodeJS and Browsers)☆19Updated 8 months ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆53Updated last week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆79Updated 3 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆97Updated this week
- A curated list of dagster code snippets for data engineers☆55Updated last year
- The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.com☆53Updated this week
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆55Updated 3 months ago
- ☆39Updated 3 weeks ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆28Updated last week
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆183Updated last week
- ☆90Updated last year
- duckdb-etl-framework☆11Updated 5 months ago
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆15Updated 7 months ago
- Unity Catalog UI☆40Updated 8 months ago
- The Modern Data Stack in a Python package☆49Updated last year
- A software engineering framework to jump start your machine learning projects☆37Updated 11 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- ☆14Updated last year
- GizmoSQL Public repo - used for README purposes and to make artifacts available for public download☆25Updated 2 weeks ago
- scraping and querying documents for LLMs☆21Updated this week
- API for distributing Data Lake Data☆11Updated 2 months ago
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆27Updated 7 months ago
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆82Updated 2 months ago
- Automate and streamline the alerting & notification process for dbt test results🐞🚀☆17Updated last month
- Chatbot for BI☆37Updated 2 years ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 4 years ago
- Next generation compute platform for the post-modern data stack☆15Updated this week
- Ibis analytics, with Ibis (and more!)☆22Updated 8 months ago