IBM / unitxtLinks
π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking
β199Updated this week
Alternatives and similar repositories for unitxt
Users that are interested in unitxt are comparing it to the libraries listed below
Sorting:
- codebase release for EMNLP2023 paper publicationβ19Updated last month
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β47Updated this week
- LM engine is a library for pretraining/finetuning LLMsβ57Updated this week
- A package dedicated for running benchmark agreement testingβ16Updated last month
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other laβ¦β27Updated this week
- Code accompanying "How I learned to start worrying about prompt formatting".β105Updated 2 weeks ago
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ60Updated last month
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ130Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ112Updated 2 weeks ago
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year
- Python library for Synthetic Data Generationβ42Updated this week
- Synthetic Data Generation for Foundation Modelsβ21Updated 4 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.β59Updated 10 months ago
- Codebase accompanying the Summary of a Haystack paper.β78Updated 9 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ64Updated last year
- The Granite Guardian models are designed to detect risks in prompts and responses.β88Updated 3 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β78Updated 2 weeks ago
- β259Updated 6 months ago
- The repository contains generative AI analytics platform application code.β26Updated last month
- A framework for few-shot evaluation of autoregressive language models.β13Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ104Updated 6 months ago
- β39Updated 11 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 8 months ago
- Let's build better datasets, together!β259Updated 6 months ago
- Functional Benchmarks and the Reasoning Gapβ87Updated 8 months ago
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ207Updated last month
- Tools for managing datasets for governance and training.β85Updated last month
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ184Updated 5 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β108Updated last year
- Pre-train Static Word Embeddingsβ79Updated 3 weeks ago