dataforgelabs / dataforge-coreLinks
DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles
☆50Updated last week
Alternatives and similar repositories for dataforge-core
Users that are interested in dataforge-core are comparing it to the libraries listed below
Sorting:
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- 🛡️ Managed isolated environments for Python☆94Updated 3 weeks ago
- Time series forecasting with DuckDB and Evidence☆41Updated 8 months ago
- A Higher-Level, Composable SQL☆45Updated this week
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆60Updated last week
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆37Updated 3 weeks ago
- Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter not…☆39Updated last year
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆55Updated last week
- BoilingData JS client (NodeJS and Browsers)☆19Updated 9 months ago
- ☆52Updated this week
- pip installable duckdb extensions published to pypi☆27Updated last week
- scraping and querying documents for LLMs☆23Updated last month
- The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.com☆55Updated last week
- ☆90Updated last year
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆34Updated last year
- Malloy Composer is a simple application to build dashboards or run ad-hoc queries using an existing Malloy model☆67Updated 3 weeks ago
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆98Updated 9 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆83Updated 4 months ago
- ☆37Updated 2 weeks ago
- S3 vector database for LLM Agents and RAG.☆43Updated last year
- Heimdall is a data orchestration and job execution platform☆56Updated last week
- DuckDB Community Extension to prompt LLMs from SQL☆49Updated 6 months ago
- ☆14Updated last year
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆96Updated last week
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆38Updated 2 months ago
- NetworkX-like Python experience for Postgres, SQLite, MongoDB, and Neo4J☆23Updated 4 months ago
- DuckDB API integrations☆34Updated 5 months ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- A serverless duckDB deployment at GCP☆40Updated 2 years ago