π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking
β211Feb 16, 2026Updated last week
Alternatives and similar repositories for unitxt
Users that are interested in unitxt are comparing it to the libraries listed below
Sorting:
- A package dedicated for running benchmark agreement testingβ17Sep 18, 2025Updated 5 months ago
- β14Dec 1, 2025Updated 2 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"β18Mar 15, 2024Updated last year
- β13Dec 15, 2025Updated 2 months ago
- Latent Large Language Modelsβ19Aug 24, 2024Updated last year
- A toolkit for scaling law research ββ57Jan 27, 2025Updated last year
- β29Jul 9, 2024Updated last year
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024β29Dec 19, 2024Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limitβ63Jun 21, 2023Updated 2 years ago
- codebase release for EMNLP2023 paper publicationβ19Sep 18, 2025Updated 5 months ago
- The predecessor of CiteLab.β18Feb 3, 2026Updated 3 weeks ago
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ213Sep 18, 2025Updated 5 months ago
- Interact with ChatGPT and GPT-4 in alternative waysβ13Mar 17, 2024Updated last year
- β82Apr 16, 2024Updated last year
- Official code for the paper "Attention as a Hypernetwork"β48Jun 22, 2024Updated last year
- train with kittens!β63Oct 25, 2024Updated last year
- Open source project for data preparation for GenAI applicationsβ903Feb 16, 2026Updated last week
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"β110Oct 11, 2025Updated 4 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efο¬cient, non-parametric infβ¦β25Oct 14, 2024Updated last year
- β24Sep 25, 2024Updated last year
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Modelsβ25Jul 22, 2024Updated last year
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)β24Jun 6, 2024Updated last year
- Interacting with bee-api through OpenAI Python SDKβ25Mar 18, 2025Updated 11 months ago
- Label shift estimation for transfer difficulty with Familiarity.β10Feb 4, 2025Updated last year
- Improving transparency of large language models' reasoningβ14Nov 25, 2025Updated 3 months ago
- DICE: Detecting In-distribution Data Contamination with LLM's Internal Stateβ11Sep 21, 2024Updated last year
- An official implementation of ProbeGenβ13Oct 20, 2024Updated last year
- GPT4 based personalized ArXiv paper assistant botβ12Mar 1, 2024Updated last year
- JAX Scalify: end-to-end scaled arithmeticsβ18Oct 30, 2024Updated last year
- The official Python library for Formulaicβ18Apr 25, 2024Updated last year
- β14Jan 6, 2025Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!β11May 8, 2024Updated last year
- β13Nov 27, 2025Updated 3 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classificationβ11Aug 12, 2023Updated 2 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)β14Aug 6, 2025Updated 6 months ago
- Python library for Evaluationβ16Feb 16, 2026Updated last week
- β12Jan 29, 2021Updated 5 years ago
- β11Oct 11, 2023Updated 2 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intentionβ12May 24, 2023Updated 2 years ago