instructor-ai / evalsLinks
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆52Updated last year
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- ☆83Updated 11 months ago
- Convert a web page to markdown☆79Updated last year
- Verbosity control for AI agents☆65Updated last year
- ☆77Updated last year
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆315Updated last month
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 8 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆336Updated last month
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆102Updated last year
- Claudette is Claude's friend☆279Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆87Updated 10 months ago
- ☆195Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated last year
- ☆188Updated 3 weeks ago
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆100Updated last year
- ☆36Updated 5 months ago
- A framework for optimizing DSPy programs with RL☆202Updated last week
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆195Updated last year
- ☆63Updated 2 months ago
- A framework for evaluating function calls made by LLMs☆38Updated last year
- ☆169Updated 3 weeks ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.☆133Updated 7 months ago
- Annoucing Instructor Cloud☆37Updated last year
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆104Updated last month
- Tools to make language models a bit easier to use☆54Updated 3 weeks ago
- ☆159Updated 10 months ago
- ☆52Updated 6 months ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆132Updated last year