Renumics / sliceguard
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
☆64Updated last year
Alternatives and similar repositories for sliceguard:
Users that are interested in sliceguard are comparing it to the libraries listed below
- Easy-to-use self-supervised representation learning for industrial AI☆26Updated 2 years ago
- Vectory provides a collection of tools to track and compare embedding versions.☆71Updated 2 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆79Updated 2 months ago
- ☆118Updated 4 months ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆58Updated last year
- Generalist and Lightweight Model for Text Classification☆90Updated this week
- NLP with Rust for Python 🦀🐍☆61Updated 9 months ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated 11 months ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆49Updated last month
- SaLSa Optimizer implementation (No learning rates needed)☆28Updated last month
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆13Updated 2 months ago
- Pre-train Static Word Embeddings☆48Updated this week
- Drift detection module for machine learning pipelines.☆21Updated last year
- ☆275Updated 8 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 9 months ago
- ☆47Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆56Updated last week
- Effective frame sampling for ML applications.☆18Updated 2 months ago
- Client interface to Cleanlab Studio and the Trustworthy Language Model☆28Updated 3 weeks ago
- A gzip-based text-classification system.☆32Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- ☆58Updated 11 months ago
- Pipeline components that support partial_fit.☆45Updated 7 months ago
- 🦄 An NLP application just for the lols: built with Haystack to get an overview of what a user is posting about on Twitter☆44Updated last year