ntropy-network / embedb
EmbeDB is a small Python wrapper around LMDB built as key-value storage for embeddings.
☆13Updated 2 years ago
Alternatives and similar repositories for embedb:
Users that are interested in embedb are comparing it to the libraries listed below
- ☆30Updated 3 years ago
- ☆42Updated last year
- Python SDK for ntropy☆18Updated last month
- Metadata store for Production ML☆89Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last week
- A Toolbox for the Evaluation of machine learning Explanations☆15Updated last year
- ML tools that we use internally and which you may find useful too.☆24Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- Python package for deduplication/entity resolution using active learning☆78Updated 4 months ago
- SPEAR: Programmatically label and build training data quickly.☆103Updated 6 months ago
- Pipeline components that support partial_fit.☆44Updated 6 months ago
- ☆30Updated 2 years ago
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆78Updated 2 years ago
- ☆19Updated 4 years ago
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆14Updated 3 months ago
- Drift detection module for machine learning pipelines.☆21Updated last year
- Gzip and nearest neighbors for text classification☆56Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- ☆28Updated 2 years ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.☆77Updated 6 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆22Updated last year
- spock is a framework that helps manage complex parameter configurations during research and development of Python applications☆129Updated last year
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆47Updated last year
- [AAAI 2021] TextWiser: Text Featurization Library☆53Updated last month
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆60Updated last year
- ☆75Updated last year
- Inference engine for GLiNER models, in Rust☆31Updated this week