Renumics / sliceguardLinks
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
☆64Updated last year
Alternatives and similar repositories for sliceguard
Users that are interested in sliceguard are comparing it to the libraries listed below
Sorting:
- Easy-to-use self-supervised representation learning for industrial AI☆26Updated 2 years ago
- Vectory provides a collection of tools to track and compare embedding versions.☆71Updated 2 years ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆63Updated last year
- BIG: Back In the Game of Creative AI☆27Updated 2 years ago
- Use sync mode Playwright interactively, inside a Jupyter notebook☆15Updated 3 months ago
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- NLP with Rust for Python 🦀🐍☆63Updated 2 months ago
- Completion After Prompt Probability. Make your LLM make a choice☆79Updated 8 months ago
- Pre-train Static Word Embeddings☆84Updated last month
- Using short models to classify long texts☆21Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- Semantic search engine indexing 110 million academic publications☆90Updated last week
- LLM finetuned for generating symbolic music☆42Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago
- 🤝 Trade any tensors over the network☆30Updated last year
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆23Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 10 months ago
- ☆12Updated last year
- ☆41Updated 2 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 6 months ago
- Efficient few-shot learning with cross-encoders.☆54Updated last year
- ☆124Updated 8 months ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆51Updated last week
- Train an adapter for any embedding model in under a minute☆106Updated 3 months ago
- The Python Component System (PCS) is an API and CLI for building, running, and sharing Python code. AgentOS is a set of libraries built o…☆19Updated 2 years ago
- Production-ready data processing made easy and shareable☆354Updated last year