kvh / dcp
Universal data copy
☆9Updated 2 years ago
Alternatives and similar repositories for dcp:
Users that are interested in dcp are comparing it to the libraries listed below
- Data pipelines from re-usable components☆108Updated last year
- ☆19Updated 4 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Arrow, pydantic style☆84Updated 2 years ago
- ☆30Updated 3 years ago
- Codd method-chained SQL generator and Pandas data processing in Python.☆117Updated last year
- A python library bakeoff for medium sized datasets☆24Updated last year
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆47Updated last year
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆61Updated 4 months ago
- ☆21Updated 6 months ago
- ☆22Updated 2 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆39Updated 2 months ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated last year
- Graph Engine for Exploration and Search☆40Updated last year
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆26Updated 2 months ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- ☆136Updated last week
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- Demos of Materialize, the operational data warehouse.☆51Updated 5 months ago
- Python binding for DataFusion☆59Updated 2 years ago
- Search for similar short strings☆52Updated 4 years ago
- Prototyping a question and answer bot over PDFs☆38Updated last year
- Chatbot for BI☆38Updated 2 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65Updated 3 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Self-contained demo using Kafka, Materialize and Metabase to check what's streaming on Twitch. All you need is Docker and Twitch access t…☆24Updated 2 years ago
- hnsw implemented by python☆19Updated 5 years ago
- Tiny inference-only implementation of LLaMA☆92Updated 10 months ago