NickCrews / mismo
The SQL/Ibis powered sklearn of record linkage
β14Updated last week
Related projects β
Alternatives and complementary repositories for mismo
- Jupyter Cell / Line Magics for DuckDBβ38Updated last month
- An experimental Athena extension for DuckDB π€β49Updated 8 months ago
- List of entity resolution software and resources.β35Updated 8 months ago
- Linear regression in SQL using dbtβ65Updated last month
- Write your dbt models using Ibisβ52Updated 3 weeks ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed dailyβ81Updated this week
- β16Updated last year
- asyncio bridge to the duckdb libraryβ34Updated last year
- A Singer.io target for DuckDBβ17Updated last month
- Ibis analytics, with Ibis (and more!)β19Updated last month
- β139Updated this week
- Examples for the MotherDuck WASM Client library, enabling MotherDuck integration for WebAssembly-powered DuckDBβ35Updated last month
- A serverless duckDB deployment at GCPβ35Updated 2 years ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, Sβ¦β12Updated 3 months ago
- A DuckDB extension to read data directly from databases supporting the ODBC interfaceβ78Updated last year
- SQLMesh example projectsβ16Updated 5 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of Bβ¦β14Updated this week
- The Modern Data Stack in a Python packageβ49Updated 11 months ago
- A repository of runnable examples using ibisβ40Updated 4 months ago
- Read Apache Arrow batches from ODBC data sources in Pythonβ57Updated 2 weeks ago
- β82Updated 6 months ago
- DuckDB Power Query Custom Connector by MotherDuckβ45Updated last month
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those pluβ¦β55Updated last year
- Scripts to make specific datasets cleaner and more convenientβ40Updated last year
- An open-source library that leverages Pythonβs data science ecosystem to build powerful end-to-end Entity Resolution workflows.β71Updated this week
- Integrates DuckDB with Google BigQuery, allowing direct querying and management of BigQuery datasetsβ58Updated this week
- FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs)β65Updated this week
- Nicely modeled data built on the Github Archive.β56Updated 8 months ago
- Duckman - Manage your DuckDB CLI with easeβ14Updated last week
- β16Updated 9 months ago