A Rust based data/CSV/Parquet file generator
β66Mar 3, 2025Updated last year
Alternatives and similar repositories for datahobbit
Users that are interested in datahobbit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Various data stream/batch process demo with Apache Scala Spark πβ12Feb 28, 2020Updated 6 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.β27Mar 25, 2024Updated 2 years ago
- Proof-of-concept extension combining the delta extension with Unity Catalogβ103Updated this week
- β84Updated this week
- dbt starter code for enterprise Snowflake usage data artifactsβ21Sep 7, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A reusable UI element for editing lists of key/value data.β15Feb 5, 2017Updated 9 years ago
- TPC-H benchmark data generation in pure Rustβ241Updated this week
- β20Nov 22, 2024Updated last year
- Demo of DuckDB Spark API implements. Same Pyspark code, but DuckDB under the hoodβ15Nov 16, 2023Updated 2 years ago
- The Data Product Specificationβ11Jan 28, 2025Updated last year
- csv and flat-file sniffer built in Rust.β45Jan 26, 2024Updated 2 years ago
- Using DuckDB with AWS Lambda to process Delta Lake dataβ34Jan 26, 2025Updated last year
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.β15Feb 15, 2025Updated last year
- Build your own S3-Select in 400 lines of Rust!β14Mar 23, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Lambda function to serverlessly repartition parquet files in S3β40Mar 30, 2025Updated last year
- β27Apr 30, 2026Updated last month
- A simple and easy to use Data Quality (DQ) tool built with Python.β51Sep 7, 2023Updated 2 years ago
- SQL Benchmark derived from TPC-DSβ15May 20, 2023Updated 3 years ago
- β12Oct 24, 2025Updated 7 months ago
- A rust implemention based on `How Query Engines Work`β15Sep 2, 2024Updated last year
- A platform to manage the data product life cycleβ22Mar 25, 2026Updated 2 months ago
- Model drift detectionβ11Jul 22, 2023Updated 2 years ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.β127Jan 21, 2025Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDBβ339May 30, 2026Updated last week
- Arrow Flight SQL Server for DuckDBβ148Mar 31, 2026Updated 2 months ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ toβ¦β29Dec 9, 2024Updated last year
- DataFusion FlightSQL Serverβ29Apr 13, 2026Updated last month
- DuckDB WebMacro: Share and Load your SQL Macros via gistsβ15Mar 24, 2026Updated 2 months ago
- Demo repository to lambda-fy your dbt runsβ11Sep 7, 2023Updated 2 years ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuckβ236May 15, 2026Updated 3 weeks ago
- DuckDB CronJob Extensionβ50Mar 29, 2026Updated 2 months ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructureβ204Oct 20, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Protobuf to Arrow, using Rustβ26May 25, 2026Updated 2 weeks ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed dailyβ113Updated this week
- An experimental Athena extension for DuckDB π€β57Dec 31, 2024Updated last year
- Python pipeline utility libraryβ18Jul 25, 2023Updated 2 years ago
- An integration for dbt and fzf that allows interactive selection and search of dbt models.β74Jul 26, 2023Updated 2 years ago
- From the SELECT team, a dbt package to automatically tag dbt-issued queries with informative metadata.β54Mar 6, 2026Updated 3 months ago
- DB API 2 interface for Flight SQL with SQLAlchemy extras.β43Sep 30, 2025Updated 8 months ago