A Rust based data/CSV/Parquet file generator
☆66Mar 3, 2025Updated last year
Alternatives and similar repositories for datahobbit
Users that are interested in datahobbit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Various data stream/batch process demo with Apache Scala Spark 🚀☆12Feb 28, 2020Updated 6 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated 2 years ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆104Updated this week
- ☆85Jun 5, 2026Updated 3 weeks ago
- dbt starter code for enterprise Snowflake usage data artifacts☆21Sep 7, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A reusable UI element for editing lists of key/value data.☆15Feb 5, 2017Updated 9 years ago
- ☆20Nov 22, 2024Updated last year
- Demo of DuckDB Spark API implements. Same Pyspark code, but DuckDB under the hood☆15Nov 16, 2023Updated 2 years ago
- The Data Product Specification☆11Jan 28, 2025Updated last year
- csv and flat-file sniffer built in Rust.☆45Jan 26, 2024Updated 2 years ago
- SQLMesh example projects☆42Jul 2, 2025Updated 11 months ago
- Using DuckDB with AWS Lambda to process Delta Lake data☆34Jan 26, 2025Updated last year
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- SQL Benchmark derived from TPC-H☆11May 20, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Lambda function to serverlessly repartition parquet files in S3☆40Mar 30, 2025Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Sep 7, 2023Updated 2 years ago
- SQL Benchmark derived from TPC-DS☆15May 20, 2023Updated 3 years ago
- ☆12Oct 24, 2025Updated 8 months ago
- Live Training Session: Machine Learning with Scikit Learn☆15Jun 30, 2020Updated 6 years ago
- a pytest plugin for dbt adapter test suites☆19Oct 31, 2023Updated 2 years ago
- A platform to manage the data product life cycle☆22Mar 25, 2026Updated 3 months ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆127Jan 21, 2025Updated last year
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB☆341May 30, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Arrow Flight SQL Server for DuckDB☆149Mar 31, 2026Updated 3 months ago
- DataFusion FlightSQL Server☆29Jun 22, 2026Updated last week
- DuckDB WebMacro: Share and Load your SQL Macros via gists☆15Mar 24, 2026Updated 3 months ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- DuckDB CronJob Extension☆50Mar 29, 2026Updated 3 months ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆205Oct 20, 2025Updated 8 months ago
- Protobuf to Arrow, using Rust☆26Jun 22, 2026Updated last week
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆114Updated this week
- An experimental Athena extension for DuckDB 🐤☆57Dec 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An integration for dbt and fzf that allows interactive selection and search of dbt models.☆74Jul 26, 2023Updated 2 years ago
- From the SELECT team, a dbt package to automatically tag dbt-issued queries with informative metadata.☆54Mar 6, 2026Updated 3 months ago
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆43Sep 30, 2025Updated 9 months ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆279Sep 25, 2024Updated last year
- Apache Spark Connect Client for Rust☆116Jun 10, 2025Updated last year
- Function for automatically detecting Simpson's Paradox☆18Jan 17, 2021Updated 5 years ago
- A repo to track data engineering projects☆14Nov 11, 2022Updated 3 years ago