EthanRosenthal / medium-data-bakeoffLinks
A python library bakeoff for medium sized datasets
☆24Updated last year
Alternatives and similar repositories for medium-data-bakeoff
Users that are interested in medium-data-bakeoff are comparing it to the libraries listed below
Sorting:
- An experimental Athena extension for DuckDB 🐤☆54Updated 5 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- A serverless duckDB deployment at GCP☆39Updated 2 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Linear regression in SQL using dbt☆70Updated 5 months ago
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated 2 years ago
- ☆90Updated last year
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆18Updated 11 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆82Updated 4 months ago
- ☆22Updated 10 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆109Updated 6 months ago
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated 3 weeks ago
- ☆33Updated last year
- ☆26Updated 2 years ago
- Dask integration for Snowflake☆30Updated 7 months ago
- A repository of runnable examples using ibis☆44Updated 11 months ago
- Ibis analytics, with Ibis (and more!)☆22Updated 9 months ago
- Assessing whether data from database complies with reference information.☆43Updated this week
- A software engineering framework to jump start your machine learning projects☆37Updated last year
- ERPL is a DuckDB extension to integrate Enterprise Data in your Data Science and ML pipelines within minutes! ERPL connects DuckDB to SAP…☆42Updated last week
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆20Updated last year
- Write your dbt models using Ibis☆67Updated 3 months ago
- A curated list of awesome SQLMesh resources☆34Updated last month
- Arrow, pydantic style☆83Updated 2 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Heads Down Application for Mac OSX☆20Updated 4 years ago
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆33Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆47Updated last year
- rust-for-data☆45Updated last year