instructor-ai / evalsLinks

Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.

☆52

Alternatives and similar repositories for evals

Users that are interested in evals are comparing it to the libraries listed below

Sorting:

eugeneyan / align-app
☆75Updated 8 months ago
AnswerDotAI / web2md-ext
Get a markdown version of any webpage with a keyboard shortcut.
☆65Updated 5 months ago
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
jxnl / n-levels-of-rag
☆195Updated last year
567-labs / systematically-improving-rag
☆156Updated last week
MaximeRivest / attachments
Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…
☆229Updated 2 weeks ago
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆151Updated 9 months ago
hamelsmu / claudesave
A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.
☆86Updated 8 months ago
567-labs / kura
Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…
☆259Updated last month
instructor-ai / cloud
Annoucing Instructor Cloud
☆36Updated 11 months ago
CyrusNuevoDia / llegos
A strongly typed Python DSL for developing message passing multi agent systems
☆53Updated last year
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆101Updated last year
AnswerDotAI / web2md
Convert a web page to markdown
☆73Updated 11 months ago
567-labs / fastllm
A collection of LLM services you can self host via docker or modal labs to support your applications development
☆192Updated last year
eugeneyan / visualizing-finetunes
☆78Updated last year
jxnl / blog
☆149Updated 3 weeks ago
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated last year
ai-evals-course / isaac-fasthtml-workshop
☆57Updated last month
AnswerDotAI / GeminiSave
☆52Updated 3 months ago
yoheinakajima / jsondr
converts url content into JSON with a simple prefix
☆70Updated last year
yoheinakajima / autofinetune
auto fine tune of models with synthetic data
☆76Updated last year
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆102Updated last year
jlewi / foyle
Foyle is a copilot to help developers deploy and operate their applications.
☆131Updated 4 months ago
braintrustdata / braintrust-cookbook
☆36Updated 2 weeks ago
AnswerDotAI / toolslm
Tools to make language models a bit easier to use
☆48Updated 2 weeks ago
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆181Updated 10 months ago
davanstrien / data-for-fine-tuning-llms
☆77Updated last year
AnswerDotAI / claudette
Claudette is Claude's friend
☆251Updated last week
seanchatmangpt / dspygen
A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.
☆126Updated 9 months ago
SpellcraftAI / oaib
Use the OpenAI Batch tool to make async batch requests to the OpenAI API.
☆99Updated last year