kvh / dcp
Universal data copy
☆9Updated 2 years ago
Alternatives and similar repositories for dcp:
Users that are interested in dcp are comparing it to the libraries listed below
- Data pipelines from re-usable components☆108Updated last year
- ☆19Updated 4 years ago
- the open-source product analytics tool for the modern data stack☆28Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Prototyping a question and answer bot over PDFs☆38Updated last year
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions☆21Updated last year
- ☆30Updated 3 years ago
- BoilingData JS client (NodeJS and Browsers)☆19Updated 3 months ago
- Efficient BM25 with DuckDB 🦆☆36Updated last month
- Codd method-chained SQL generator and Pandas data processing in Python.☆117Updated last year
- Tantivy directory implementation backed by object_store☆29Updated 11 months ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆47Updated last year
- Convert JSON files to Apache Parquet.☆46Updated last year
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated last year
- Arrow, pydantic style☆84Updated 2 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- Linear regression in SQL using dbt☆69Updated last week
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆14Updated 3 months ago
- The Data-centric IDE for Data Science and AI☆36Updated 2 years ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆13Updated 6 months ago
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆76Updated 2 years ago
- A Higher-Level, Composable SQL☆38Updated this week
- ☆27Updated 7 months ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- Tiny inference-only implementation of LLaMA☆91Updated 9 months ago
- dbt-prql allows writing PRQL in dbt models☆101Updated last week