ntropy-network / embedb
EmbeDB is a small Python wrapper around LMDB built as key-value storage for embeddings.
☆13Updated 2 years ago
Alternatives and similar repositories for embedb:
Users that are interested in embedb are comparing it to the libraries listed below
- Python SDK for ntropy☆18Updated this week
- SPEAR: Programmatically label and build training data quickly.☆103Updated 5 months ago
- An efficient, to-the-point, and easy-to-use checklist to following when deploying an ML model into production.☆31Updated last year
- ☆30Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆78Updated 3 months ago
- Repository for my master thesis on automated string handling☆16Updated 3 years ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆46Updated last year
- Efficient BM25 with DuckDB 🦆☆31Updated last month
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆17Updated last year
- Drift detection module for machine learning pipelines.☆21Updated last year
- ML tools that we use internally and which you may find useful too.☆24Updated 2 years ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- Pipeline components that support partial_fit.☆43Updated 4 months ago
- HiPlot fetcher for experiments logged with MLflow☆13Updated 2 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 2 years ago
- A Toolbox for the Evaluation of machine learning Explanations☆15Updated 10 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆21Updated 11 months ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆26Updated this week
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆36Updated 7 months ago
- Batch shap calculations.☆31Updated last year
- Hassle-free ML Pipelines on Kubernetes☆38Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 2 years ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.☆77Updated 4 months ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆104Updated last year
- Prototyping a question and answer bot over PDFs☆38Updated last year
- Retrieval Augmented Generation applications☆27Updated last year
- ☆42Updated last year
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated last year