level-vc / useful
The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.
☆20Updated last year
Alternatives and similar repositories for useful
Users that are interested in useful are comparing it to the libraries listed below
Sorting:
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Python binding for DataFusion☆59Updated 2 years ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 3 months ago
- Ibis analytics, with Ibis (and more!)☆21Updated 7 months ago
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆33Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 4 months ago
- A conda-smithy repository for python-duckdb.☆13Updated 3 weeks ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆77Updated 2 months ago
- Arrow, pydantic style☆82Updated 2 years ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- ☆21Updated 8 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- A repository of runnable examples using ibis☆43Updated 10 months ago
- A monorepo of many Rill example projects☆36Updated last week
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆36Updated 3 weeks ago
- Dask integration for Snowflake☆30Updated 5 months ago
- scraping and querying documents for LLMs☆20Updated this week
- Apache Arrow Development Experiments☆20Updated 3 months ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆17Updated 9 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆97Updated this week
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 5 months ago
- ☆42Updated last week
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- Native polars deltalake reader☆9Updated 8 months ago
- ☆90Updated last year
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Unified Distributed Execution☆52Updated 6 months ago
- A collection of self-contained fsspec-based filesystems☆16Updated last week
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last year