Renumics / sliceguardLinks
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
☆64Updated 2 years ago
Alternatives and similar repositories for sliceguard
Users that are interested in sliceguard are comparing it to the libraries listed below
Sorting:
- Easy-to-use self-supervised representation learning for industrial AI☆26Updated 2 years ago
- Vectory provides a collection of tools to track and compare embedding versions.☆71Updated 3 years ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆64Updated 2 years ago
- ☆125Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- Pre-train Static Word Embeddings☆94Updated 4 months ago
- AI Data Management & Evaluation Platform☆215Updated 2 years ago
- Production-ready data processing made easy and shareable☆358Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- Generalist and Lightweight Model for Text Classification☆166Updated last month
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆86Updated last year
- Reads arXiv papers using Text-to-Speech☆63Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated last month
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆511Updated 5 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Datamodels for hugging face tokenizers☆86Updated last week
- ☆358Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebook☆17Updated 9 months ago
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆242Updated last week
- A public repo that contains integrations for Argilla and LlamaIndex.☆17Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated last month
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆45Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year