dataforgelabs / dataforge-coreLinks
DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles
β59Updated 2 weeks ago
Alternatives and similar repositories for dataforge-core
Users that are interested in dataforge-core are comparing it to the libraries listed below
Sorting:
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applicationsβ106Updated last year
- π‘οΈ Managed isolated environments for Pythonβ109Updated last week
- A monorepo of many Rill example projectsβ47Updated 2 weeks ago
- BoilingData JS client (NodeJS and Browsers)β19Updated last year
- The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.comβ60Updated this week
- Open-source repository for Semantic Modeling Language (SML)β136Updated 2 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of Bβ¦β65Updated last week
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- DuckDB Community Extension to prompt LLMs from SQLβ55Updated this week
- Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈβ18Updated 2 weeks ago
- scraping and querying documents for LLMsβ24Updated 4 months ago
- Simple Workflow Framework - Hamilton + Task Queue (RQ or APScheduler) = FlowerPowerβ23Updated 2 months ago
- Next generation compute platform for the post-modern data stackβ22Updated this week
- β14Updated 5 months ago
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.β73Updated 2 weeks ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β66Updated last week
- Serverless for data practitioners. The fastest β‘οΈ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter notβ¦β41Updated last year
- Geniusrise: Framework for building geniusesβ61Updated 2 months ago
- Time series forecasting with DuckDB and Evidenceβ43Updated last year
- Chrome Extension for exploring Hugging Face datasets πβ48Updated last year
- Guide for running a custom API Powered by Snowflake in Pythonβ21Updated 8 months ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructureβ203Updated 3 months ago
- A curated list of dagster code snippets for data engineersβ56Updated last year
- Data management with LLMsβ182Updated last year
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB workerβ¦β18Updated 2 years ago
- Contribute to dlt verified sources π₯β104Updated last month
- β¨ Build dashboards with end-to-end version control. π CLI w/ batteries included, no infra required. Develop on your laptop for instant rβ¦β92Updated this week
- Unity Catalog UIβ43Updated last year
- Python+VueJS application to load, explore, combine,transform and deliver dataβ102Updated 11 months ago
- portable Python ML-powered data botβ25Updated last year