A Rust based data/CSV/Parquet file generator
☆66Mar 3, 2025Updated last year
Alternatives and similar repositories for datahobbit
Users that are interested in datahobbit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated 2 years ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆100Apr 23, 2026Updated last week
- ☆84Updated this week
- dbt starter code for enterprise Snowflake usage data artifacts☆21Sep 7, 2022Updated 3 years ago
- TPC-H benchmark data generation in pure Rust☆240Apr 20, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Nov 22, 2024Updated last year
- The Data Product Specification☆11Jan 28, 2025Updated last year
- csv and flat-file sniffer built in Rust.☆45Jan 26, 2024Updated 2 years ago
- SQLMesh example projects☆41Jul 2, 2025Updated 10 months ago
- Using DuckDB with AWS Lambda to process Delta Lake data☆33Jan 26, 2025Updated last year
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- SQL Benchmark derived from TPC-H☆11May 20, 2023Updated 2 years ago
- Build your own S3-Select in 400 lines of Rust!☆14Mar 23, 2025Updated last year
- Lambda function to serverlessly repartition parquet files in S3☆39Mar 30, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Sep 7, 2023Updated 2 years ago
- SQL Benchmark derived from TPC-DS☆15May 20, 2023Updated 2 years ago
- ☆12Oct 24, 2025Updated 6 months ago
- a pytest plugin for dbt adapter test suites☆19Oct 31, 2023Updated 2 years ago
- A rust implemention based on `How Query Engines Work`☆15Sep 2, 2024Updated last year
- A platform to manage the data product life cycle☆22Mar 25, 2026Updated last month
- Model drift detection☆11Jul 22, 2023Updated 2 years ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆127Jan 21, 2025Updated last year
- Arrow Flight SQL Server for DuckDB☆143Mar 31, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB☆331Updated this week
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Dec 9, 2024Updated last year
- DataFusion FlightSQL Server☆29Apr 13, 2026Updated 2 weeks ago
- DuckDB WebMacro: Share and Load your SQL Macros via gists☆15Mar 24, 2026Updated last month
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆235Apr 20, 2026Updated last week
- DuckDB CronJob Extension☆48Mar 29, 2026Updated last month
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆202Oct 20, 2025Updated 6 months ago
- Protobuf to Arrow, using Rust☆25Apr 17, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Learn about Spice.ai with in-depth samples☆19Dec 19, 2024Updated last year
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆113Updated this week
- An experimental Athena extension for DuckDB 🐤☆57Dec 31, 2024Updated last year
- Python pipeline utility library☆18Jul 25, 2023Updated 2 years ago
- An integration for dbt and fzf that allows interactive selection and search of dbt models.☆74Jul 26, 2023Updated 2 years ago
- From the SELECT team, a dbt package to automatically tag dbt-issued queries with informative metadata.☆54Mar 6, 2026Updated last month
- A simple Rust library to retrieve data from https://api.carbonintensity.org.uk/☆11Apr 25, 2026Updated last week