kvh / dcp
Universal data copy
β9Updated 2 years ago
Alternatives and similar repositories for dcp:
Users that are interested in dcp are comparing it to the libraries listed below
- Data pipelines from re-usable componentsβ108Updated last year
- π Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durationsβ47Updated 2 years ago
- the open-source product analytics tool for the modern data stackβ28Updated 2 years ago
- β19Updated 4 years ago
- β30Updated 3 years ago
- A python library bakeoff for medium sized datasetsβ24Updated last year
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)β14Updated 5 months ago
- BoilingData JS client (NodeJS and Browsers)β19Updated 6 months ago
- Linear regression in SQL using dbtβ69Updated 2 months ago
- Search for similar short stringsβ52Updated 4 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB workerβ¦β18Updated last year
- Highly concurrent and fast content processing for Mighty Inference Serverβ10Updated 2 years ago
- A utility for labeling clusters of text data.β28Updated 3 years ago
- Build your feature store with macros right within your dbt repositoryβ38Updated 2 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ26Updated last year
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.β73Updated last month
- The stupidest database of all time.β55Updated this week
- Set-oriented Operations in Pandasβ24Updated 4 years ago
- Arrow, pydantic styleβ82Updated 2 years ago
- A Python package that parses sql and converts it to ibis expressionsβ54Updated last year
- A Higher-Level, Composable SQLβ41Updated this week
- β21Updated 7 months ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Auβ¦β42Updated 4 years ago
- SQL transformation tool for DuckDB written in Rustβ43Updated last week
- EmbeDB is a small Python wrapper around LMDB built as key-value storage for embeddings.β13Updated 2 years ago
- Blazing fast, composable, Pythonic quantile filters.β136Updated last year
- A [personal]<-[notebook]->[network]. Complete with custom numerics for constrained Gaussian gravitation physics.β22Updated 2 years ago
- β22Updated 2 weeks ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ toβ¦β29Updated 3 months ago
- Supporting materials/code examples for my course in data engineering for machine learning.β38Updated 2 years ago