dataforgelabs / dataforge-coreLinks
DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles
β59Updated this week
Alternatives and similar repositories for dataforge-core
Users that are interested in dataforge-core are comparing it to the libraries listed below
Sorting:
- π‘οΈ Managed isolated environments for Pythonβ107Updated 2 weeks ago
- BoilingData JS client (NodeJS and Browsers)β19Updated last year
- scraping and querying documents for LLMsβ24Updated 3 months ago
- DuckDB Community Extension to prompt LLMs from SQLβ53Updated last month
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applicationsβ106Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB workerβ¦β18Updated 2 years ago
- β¨ Build dashboards with end-to-end version control. π CLI w/ batteries included, no infra required. Develop on your laptop for instant rβ¦β89Updated this week
- The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.comβ60Updated this week
- A monorepo of many Rill example projectsβ47Updated this week
- Chrome Extension for exploring Hugging Face datasets πβ48Updated last year
- Simple Workflow Framework - Hamilton + Task Queue (RQ or APScheduler) = FlowerPowerβ22Updated last month
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of Bβ¦β64Updated last month
- This repo contains information about DuckDB extensions found on GitHub. Refreshed dailyβ108Updated this week
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution β¦β39Updated 2 years ago
- Radio is a DuckDB extension by Query.Farm that brings real-time event streams into your SQL workflows. It enables DuckDB to receive and sβ¦β35Updated last month
- Time series forecasting with DuckDB and Evidenceβ43Updated last year
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.β47Updated 4 months ago
- Next generation compute platform for the post-modern data stackβ20Updated this week
- Create and manage data pipes with Meerschaum.β153Updated 2 weeks ago
- β14Updated 4 months ago
- tsellm: LLMs in SQLite and DuckDBβ25Updated 8 months ago
- Data management with LLMsβ180Updated 11 months ago
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.β38Updated 3 months ago
- In-browser data analysis using SQL | Powered by duckdb-wasmβ26Updated 3 weeks ago
- Serverless for data practitioners. The fastest β‘οΈ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter notβ¦β41Updated last year
- A DuckDB extension for graph data analyticsβ91Updated last week
- Geniusrise: Framework for building geniusesβ61Updated last month
- A Python framework for defining and querying BI models in your data warehouseβ169Updated 11 months ago
- A playground for running duckdb as a stateless query engine over a data lakeβ216Updated 2 years ago