IBM / unitxt
🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking
☆191Updated this week
Alternatives and similar repositories for unitxt:
Users that are interested in unitxt are comparing it to the libraries listed below
- codebase release for EMNLP2023 paper publication☆19Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆107Updated 7 months ago
- Dolomite Engine is a library for pretraining/finetuning LLMs☆52Updated this week
- A framework for few-shot evaluation of autoregressive language models.☆13Updated last year
- The Granite Guardian models are designed to detect risks in prompts and responses.☆79Updated last month
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated 4 months ago
- A package dedicated for running benchmark agreement testing☆16Updated this week
- WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.☆41Updated 9 months ago
- Evaluating LLMs with CommonGen-Lite☆90Updated last year
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆41Updated this week
- Let's build better datasets, together!☆259Updated 4 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆255Updated 9 months ago
- ☆45Updated 3 months ago
- This is the official repository for Inheritune.☆111Updated 2 months ago
- ☆117Updated last month
- ☆36Updated 9 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆61Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- ☆57Updated 7 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆199Updated this week
- Python library for Synthetic Data Generation☆42Updated this week
- ☆254Updated 5 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆76Updated 6 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆180Updated 4 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆215Updated 6 months ago
- AI Evaluation Platform☆46Updated last month
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆108Updated 3 weeks ago