π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking
β211Apr 21, 2026Updated last week
Alternatives and similar repositories for unitxt
Users that are interested in unitxt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A package dedicated for running benchmark agreement testingβ18Sep 18, 2025Updated 7 months ago
- orchestrate ML model deployments, monitoring and governance for Cloud Pak for Data and other Data & AI platforms faster.β12Dec 22, 2025Updated 4 months ago
- codebase release for EMNLP2023 paper publicationβ19Sep 18, 2025Updated 7 months ago
- A library of components to help agent builders boost their agent performance (tool-calling, instruction following, policy, etc.)β114Mar 16, 2026Updated last month
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ216Sep 18, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Open source no-code system for text annotation and building of text classifiersβ272May 26, 2025Updated 11 months ago
- An open source benchmarking framework for IT automationβ308Updated this week
- Quality Controlled Paraphrase Generation (ACL 2022)β71Sep 17, 2025Updated 7 months ago
- Run the entire bee application stack using docker-composeβ155Mar 18, 2025Updated last year
- Latent Large Language Modelsβ19Aug 24, 2024Updated last year
- Mellea is a library for writing generative programs.β397Updated this week
- Examples and tutorials for building AI applications with watsonx.ai Flows Engineβ118Sep 18, 2025Updated 7 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.β739Sep 18, 2025Updated 7 months ago
- The predecessor of CiteLab.β18Feb 3, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Go client library for IBM Cloud Platform Servicesβ20Updated this week
- In-Context Explainability 360 toolkitβ66Mar 9, 2026Updated last month
- API backend for Beeβ36Mar 18, 2025Updated last year
- Updating collection of summarization datasets in 100+ languages, based on our paper "The State and Fate of Summarization Datasets: A Survβ¦β31Apr 29, 2025Updated last year
- InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy dataβ¦β1,415Mar 30, 2026Updated 3 weeks ago
- β20May 30, 2024Updated last year
- Additional tools for the Bee Agent Frameworkβ14Mar 18, 2025Updated last year
- Python library for Synthetic Data Generationβ52Mar 31, 2026Updated last month
- β13Dec 15, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"β18Mar 15, 2024Updated 2 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)β15Apr 8, 2026Updated 3 weeks ago
- β29Jul 9, 2024Updated last year
- Open source project for data preparation for GenAI applicationsβ928Mar 13, 2026Updated last month
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolutionβ12Jun 15, 2023Updated 2 years ago
- CUGA is an open-source generalist agent harness for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrβ¦β707Updated this week
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limitβ63Jun 21, 2023Updated 2 years ago
- β15Oct 4, 2024Updated last year
- Web application (UI) for Bee Stack.β76Mar 18, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- β11Sep 19, 2024Updated last year
- Experiment of using Tangent to autodiff tritonβ82Jan 22, 2024Updated 2 years ago
- A different, but useful, textcat approach.β18Jul 15, 2024Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ104Jan 15, 2024Updated 2 years ago
- A toolkit for scaling law research ββ62Jan 27, 2025Updated last year
- Research framework for low resource text classification that allows the user to experiment with classification models and active learningβ¦β101Mar 9, 2022Updated 4 years ago
- Repository for Sparse Universal Transformersβ20Oct 23, 2023Updated 2 years ago