kvh / dcpLinks

Universal data copy

☆9

Alternatives and similar repositories for dcp

Users that are interested in dcp are comparing it to the libraries listed below

Sorting:

patterns-app / patterns-devkit
Data pipelines from re-usable components
☆108Updated 2 years ago
wagtaillabs / GRANT
☆19Updated 4 years ago
gjreda / scratch-pdf-bot
Prototyping a question and answer bot over PDFs
☆39Updated last year
maxdotio / mighty-batch
Highly concurrent and fast content processing for Mighty Inference Server
☆10Updated 2 years ago
cleanzr / dblink
Distributed Bayesian Entity Resolution in Apache Spark
☆57Updated 4 years ago
MaxHalford / taxi-demo-rp-mz-rv-rd-st
🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations
☆45Updated 2 years ago
jorgecarleitao / arrowdantic
Arrow, pydantic style
☆83Updated 2 years ago
google / nitroml
NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…
☆43Updated 4 years ago
Florents-Tselai / pandas-sets
Set-oriented Operations in Pandas
☆24Updated 5 years ago
fal-ai / dbt_feature_store
Build your feature store with macros right within your dbt repository
☆38Updated 2 years ago
pmbaumgartner / clabel
A utility for labeling clusters of text data.
☆28Updated 3 years ago
Ben-Epstein / spacy-to-hf
A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)
☆15Updated 8 months ago
HazyResearch / epoxy
Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings
☆77Updated 2 years ago
MuttData / muttlib
☆29Updated last year
dylan-profiler / compressio
Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…
☆29Updated 2 years ago
aivanzhang / panda_patrol
☆22Updated 10 months ago
jodietheai / NER-10K
NER model for 10K and 10Q SEC filings
☆14Updated 5 years ago
jason-jz-zhu / databathing
☆22Updated 3 months ago
pmbaumgartner / spacy-setfit-textcat
☆30Updated 3 years ago
MonetDBSolutions / MonetDBe-Python
Embedded MonetDB with a Python frontend and fast Numpy/Pandas support
☆62Updated 8 months ago
hotgluexyz / gluestick
A small Python module containing quick utility functions for standard ETL processes.
☆35Updated this week
romanorac / pandas-analytics-server
Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser
☆33Updated 2 years ago
dncc / qpick
Search for similar short strings
☆52Updated 4 years ago
astronomer / ray-airflow-demo
☆30Updated 3 years ago
wesmadrigal / GraphReduce
Abstractions for feature engineering on large graphs of tabular data.
☆21Updated 3 weeks ago
rescrv / napkin
Back-of-the-envelope stuffs in Python
☆20Updated last year
hypergol / hypergol
Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…
☆53Updated 2 years ago
elehcimd / nb2md
Conversion of Jupyter and Zeppelin notebooks to Jupyter or Markdown formats
☆27Updated 4 years ago
seatgeek / druzhba
☆36Updated 5 months ago
datafusion-contrib / datafusion-python
Python binding for DataFusion
☆59Updated 2 years ago