IBM / unitxtLinks

🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data for end-to-end AI benchmarking

☆206

Alternatives and similar repositories for unitxt

Users that are interested in unitxt are comparing it to the libraries listed below

Sorting:

huggingface / data-is-better-together
Let's build better datasets, together!
☆260Updated 7 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆268Updated last year
QuixiAI / spectrum
☆129Updated 4 months ago
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆291Updated last month
writer / writing-in-the-margins
☆118Updated 11 months ago
MoritzLaurer / synthetic-data-blog
This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data
☆68Updated last year
patronus-ai / Lynx-hallucination-detection
☆41Updated last year
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆65Updated last year
zetaalphavector / RAGElo
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
☆114Updated 3 weeks ago
msclar / formatspread
Code accompanying "How I learned to start worrying about prompt formatting".
☆107Updated last month
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
IBM / ensemble-instruct
codebase release for EMNLP2023 paper publication
☆19Updated 3 months ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 10 months ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆77Updated 9 months ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆218Updated this week
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆284Updated 5 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
MadryLab / context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
☆268Updated 10 months ago
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆185Updated 3 weeks ago
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆160Updated last year
wang-research-lab / agentinstruct
Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"
☆115Updated 10 months ago
jakespringer / echo-embeddings
☆152Updated last year
ibm-granite / granite-3.0-language-models
☆261Updated last month
microsoft / llm-data-creation
Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"
☆135Updated last year
microsoft / FILM
Official repo for "Make Your LLM Fully Utilize the Context"
☆253Updated last year
allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆267Updated 3 months ago
AnswerDotAI / fastdata
☆154Updated 8 months ago
Data-Provenance-Initiative / Data-Provenance-Collection
☆244Updated 4 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago