π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking
β211Feb 16, 2026Updated last month
Alternatives and similar repositories for unitxt
Users that are interested in unitxt are comparing it to the libraries listed below
Sorting:
- A package dedicated for running benchmark agreement testingβ17Sep 18, 2025Updated 6 months ago
- β14Dec 1, 2025Updated 3 months ago
- orchestrate ML model deployments, monitoring and governance for Cloud Pak for Data and other Data & AI platforms faster.β12Dec 22, 2025Updated 2 months ago
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024β30Dec 19, 2024Updated last year
- codebase release for EMNLP2023 paper publicationβ19Sep 18, 2025Updated 6 months ago
- The Agent Lifecycle Toolkit (ALTK) is a library of components to help agent builders improve their agent with minimal integration effort β¦β111Updated this week
- Top papers related to LLM-based agent evaluationβ89Oct 21, 2025Updated 4 months ago
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ214Sep 18, 2025Updated 6 months ago
- An official implementation of ProbeGenβ13Oct 20, 2024Updated last year
- An open source benchmarking framework for IT automationβ310Updated this week
- β13Jul 13, 2025Updated 8 months ago
- Interacting with bee-api through OpenAI Python SDKβ25Mar 18, 2025Updated last year
- Latent Large Language Modelsβ19Aug 24, 2024Updated last year
- Mellea is a library for writing generative programs.β345Updated this week
- Official PyTorch Implementation for the "Unsupervised Model Tree Heritage Recovery" paper (ICLR 2025).β63Jul 1, 2025Updated 8 months ago
- A Lossless Compression Library for AI pipelinesβ310Jul 3, 2025Updated 8 months ago
- Implementation of sequential Information Bottleneck (sIB) in Python and in C++β20Updated this week
- The prime repository for state-of-the-art Multilingual Question Answering research and development.β739Sep 18, 2025Updated 6 months ago
- The predecessor of CiteLab.β18Feb 3, 2026Updated last month
- In-Context Explainability 360 toolkitβ65Mar 9, 2026Updated last week
- An official PyTorch implementation for CLIPPRβ30Jul 22, 2023Updated 2 years ago
- API backend for Beeβ35Mar 18, 2025Updated last year
- Protobuf interface and stubs for Bee Agent Framework.β10Mar 18, 2025Updated last year
- Updating collection of summarization datasets in 100+ languages, based on our paper "The State and Fate of Summarization Datasets: A Survβ¦β30Apr 29, 2025Updated 10 months ago
- InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy dataβ¦β1,410Feb 16, 2026Updated last month
- β20May 30, 2024Updated last year
- Additional tools for the Bee Agent Frameworkβ14Mar 18, 2025Updated last year
- Python library for Synthetic Data Generationβ52Feb 16, 2026Updated last month
- An evaluation suite for Retrieval-Augmented Generation (RAG).β23Apr 26, 2025Updated 10 months ago
- An implementation of GrASP (Shnarch et. al., 2017)β23Aug 29, 2022Updated 3 years ago
- β13Dec 15, 2025Updated 3 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"β18Mar 15, 2024Updated 2 years ago
- Synthetic Data Generation for Foundation Modelsβ21Nov 10, 2025Updated 4 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)β15Aug 6, 2025Updated 7 months ago
- β29Jul 9, 2024Updated last year
- Open source project for data preparation for GenAI applicationsβ911Mar 13, 2026Updated last week
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolutionβ12Jun 15, 2023Updated 2 years ago
- CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, β¦β692Mar 12, 2026Updated last week
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.β34Jun 30, 2024Updated last year