Renumics / sliceguard
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
☆62Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for sliceguard
- Easy-to-use self-supervised representation learning for industrial AI☆25Updated last year
- Vectory provides a collection of tools to track and compare embedding versions.☆70Updated last year
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆54Updated 11 months ago
- ☆115Updated last week
- SaLSa Optimizer implementation (No learning rates needed)☆28Updated last month
- a unified framework for leveraging LLMs☆55Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆59Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆42Updated 2 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 9 months ago
- Actually Robust Training - Tool Inspired by Andrej Karpathy "Recipe for training neural networks". It allows you to decompose your Deep…☆44Updated 6 months ago
- ☆74Updated 5 months ago
- Chunk your text using gpt4o-mini more accurately☆39Updated 3 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- Production-grade embedding generation, for any length of text, for transformer models.☆24Updated this week
- Generalist and Lightweight Model for Text Classification☆48Updated 2 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆46Updated this week
- Completion After Prompt Probability. Make your LLM make a choice☆69Updated last week
- ☆21Updated last year
- ☆130Updated last week
- Fine-tune Mistral 7B to generate fashion style suggestions☆31Updated 10 months ago
- Library for creating causal chains using language models.☆76Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆44Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Web App for generating synthetic data☆45Updated 2 months ago
- ☆36Updated last week
- [WIP] A 🔥 interface for running code in the cloud☆86Updated last year
- 📚 Datasets and models for instruction-tuning☆231Updated last year
- Client interface for all things Cleanlab Studio☆27Updated this week
- Tools to make language models a bit easier to use☆30Updated 2 weeks ago
- ☆46Updated 9 months ago