Renumics / sliceguardLinks
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
☆64Updated last year
Alternatives and similar repositories for sliceguard
Users that are interested in sliceguard are comparing it to the libraries listed below
Sorting:
- Vectory provides a collection of tools to track and compare embedding versions.☆71Updated 2 years ago
- Easy-to-use self-supervised representation learning for industrial AI☆26Updated 2 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆80Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆505Updated 3 months ago
- Production-ready data processing made easy and shareable☆353Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆64Updated last year
- ☆124Updated last year
- Pre-train Static Word Embeddings☆89Updated 2 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 10 months ago
- 📚 Datasets and models for instruction-tuning☆237Updated 2 years ago
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆102Updated last year
- This is the official PyTorch implementation for our NAACL 2024 paper: "AnchorAL: Computationally Efficient Active Learning for Large and …☆21Updated 6 months ago
- 🦄 An NLP application just for the lols: built with Haystack to get an overview of what a user is posting about on Twitter☆45Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆212Updated last year
- A public repo that contains integrations for Argilla and LlamaIndex.☆17Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- ☆358Updated last year
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated 4 months ago
- Web App for generating synthetic data☆48Updated last year
- ☆20Updated 2 years ago
- Library for creating causal chains using language models.☆81Updated 2 years ago
- Semantic search engine indexing 110 million academic publications☆91Updated last week
- Generalist and Lightweight Model for Text Classification☆164Updated 4 months ago
- Classify data instantly using an LLM☆274Updated last year
- ☆21Updated 2 years ago