dataforgelabs / dataforge-coreLinks
DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles
β55Updated 2 weeks ago
Alternatives and similar repositories for dataforge-core
Users that are interested in dataforge-core are comparing it to the libraries listed below
Sorting:
- The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.comβ59Updated this week
- π‘οΈ Managed isolated environments for Pythonβ105Updated last week
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- BoilingData JS client (NodeJS and Browsers)β19Updated last year
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualitβ¦β65Updated last month
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applicationsβ104Updated last year
- DuckDB Community Extension to prompt LLMs from SQLβ51Updated 2 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of Bβ¦β58Updated last week
- Next generation compute platform for the post-modern data stackβ20Updated last week
- A monorepo of many Rill example projectsβ45Updated last week
- A curated list of dagster code snippets for data engineersβ56Updated last year
- Time series forecasting with DuckDB and Evidenceβ42Updated last year
- β¨ Build dashboards with end-to-end version control. π CLI w/ batteries included, no infra required. Develop on your laptop for instant rβ¦β85Updated last week
- A playground for running duckdb as a stateless query engine over a data lakeβ214Updated last year
- Contribute to dlt verified sources π₯β101Updated this week
- Repo for orienting dbt users to the Dagster asset frameworkβ56Updated 3 years ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructureβ195Updated last month
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB workerβ¦β18Updated last year
- scraping and querying documents for LLMsβ24Updated last month
- β14Updated 3 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed dailyβ105Updated this week
- The open source metrics layerβ43Updated last week
- A Python framework for defining and querying BI models in your data warehouseβ169Updated 10 months ago
- Create and manage data pipes with Meerschaum.β153Updated this week
- β90Updated last year
- BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is β¦β198Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈβ18Updated 3 weeks ago
- β67Updated last year
- Simple Workflow Framework - Hamilton + Task Queue (RQ or APScheduler) = FlowerPowerβ22Updated 2 weeks ago
- Tableau Connector for DuckDBβ11Updated last year