kvh / dcpLinks
Universal data copy
☆9Updated 2 years ago
Alternatives and similar repositories for dcp
Users that are interested in dcp are comparing it to the libraries listed below
Sorting:
- Data pipelines from re-usable components☆108Updated 2 years ago
- ☆30Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆46Updated 2 years ago
- ☆19Updated 4 years ago
- Time series forecasting with DuckDB and Evidence☆39Updated 7 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 2 years ago
- Arrow, pydantic style☆82Updated 2 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- A small Python module containing quick utility functions for standard ETL processes.☆35Updated last month
- Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with function…☆90Updated 3 years ago
- Prototyping a question and answer bot over PDFs☆39Updated last year
- Self-contained demo using Kafka, Materialize and Metabase to check what's streaming on Twitch. All you need is Docker and Twitch access t…☆25Updated 3 years ago
- Type System for Data Analysis in Python☆212Updated 4 months ago
- BoilingData JS client (NodeJS and Browsers)☆19Updated 8 months ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- ☆22Updated 9 months ago
- Tiny inference-only implementation of LLaMA☆93Updated last year
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 2 years ago
- GraphQL service for arrow tables and parquet data sets.☆89Updated 4 months ago
- Python binding for DataFusion☆59Updated 2 years ago
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆62Updated 8 months ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆43Updated 4 years ago
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆14Updated 8 months ago
- python library for automated dataset normalization☆115Updated last year
- Demos of Materialize, the operational data warehouse.☆51Updated 2 months ago
- Conversion of Jupyter and Zeppelin notebooks to Jupyter or Markdown formats☆27Updated 4 years ago
- Blazing fast, composable, Pythonic quantile filters.☆136Updated 2 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆107Updated 2 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 5 months ago