pentoai / vectory
Vectory provides a collection of tools to track and compare embedding versions.
β70Updated last year
Related projects β
Alternatives and complementary repositories for vectory
- Efficient BM25 with DuckDB π¦β29Updated last month
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.β63Updated 10 months ago
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated last year
- KEN: Relational Data Embeddingsβ27Updated 10 months ago
- NLP with Rust for Python π¦πβ59Updated 5 months ago
- Check if you have training samples in your test setβ64Updated 2 years ago
- Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scoresβ100Updated last year
- Weakly Supervised End-to-End Learning (NeurIPS 2021)β153Updated last year
- Functional deep learningβ106Updated last year
- Vectorizers for a range of different data typesβ97Updated this week
- The long missing library for python confidence intervalsβ132Updated 5 months ago
- Tree-based indexes for neural-searchβ28Updated 8 months ago
- A python package for benchmarking interpretability techniques on Transformers.β212Updated last month
- Generalist and Lightweight Model for Text Classificationβ49Updated last week
- Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to productionβ¦β29Updated 11 months ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpoβ¦β103Updated 8 months ago
- An active learning library for Pytorch based on Lightning-Fabric.β79Updated 6 months ago
- Distributed skorch on Ray Trainβ57Updated 2 years ago
- SPEAR: Programmatically label and build training data quickly.β103Updated 4 months ago
- just a bunch of useful embeddingsβ466Updated 2 months ago
- Super Simple Similarities Serviceβ142Updated last year
- automatic data slicingβ35Updated 3 years ago
- Late Interaction Models Training & Retrievalβ165Updated this week
- β30Updated 2 years ago
- A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient wayβ282Updated this week
- Run compute jobs on AWS as if you were running them locally.β125Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)β151Updated last year
- Cyclemoid implementation for PyTorchβ87Updated 2 years ago
- Drift detection module for machine learning pipelines.β21Updated last year