kvh / dcpLinks
Universal data copy
☆9Updated 2 years ago
Alternatives and similar repositories for dcp
Users that are interested in dcp are comparing it to the libraries listed below
Sorting:
- Data pipelines from re-usable components☆108Updated 2 years ago
- ☆19Updated 4 years ago
- Prototyping a question and answer bot over PDFs☆39Updated last year
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 2 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆45Updated 2 years ago
- Arrow, pydantic style☆83Updated 2 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆43Updated 4 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆15Updated 8 months ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 2 years ago
- ☆29Updated last year
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆29Updated 2 years ago
- ☆22Updated 10 months ago
- NER model for 10K and 10Q SEC filings☆14Updated 5 years ago
- ☆22Updated 3 months ago
- ☆30Updated 3 years ago
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆62Updated 8 months ago
- A small Python module containing quick utility functions for standard ETL processes.☆35Updated this week
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated 2 years ago
- Search for similar short strings☆52Updated 4 years ago
- ☆30Updated 3 years ago
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated 3 weeks ago
- Back-of-the-envelope stuffs in Python☆20Updated last year
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- Conversion of Jupyter and Zeppelin notebooks to Jupyter or Markdown formats☆27Updated 4 years ago
- ☆36Updated 5 months ago
- Python binding for DataFusion☆59Updated 2 years ago