instructor-ai / evalsLinks
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆52Updated 10 months ago
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- Get a markdown version of any webpage with a keyboard shortcut.☆65Updated 6 months ago
- ☆79Updated 9 months ago
- Verbosity control for AI agents☆65Updated last year
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆254Updated last month
- Convert a web page to markdown☆77Updated 11 months ago
- ☆196Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆87Updated 8 months ago
- ☆52Updated 4 months ago
- ☆161Updated last week
- ☆78Updated last year
- Annoucing Instructor Cloud☆37Updated last year
- Claudette is Claude's friend☆260Updated last week
- ☆61Updated 2 weeks ago
- Tools to make language models a bit easier to use☆48Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 10 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆102Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆102Updated last year
- A framework for optimizing DSPy programs with RL☆150Updated this week
- ☆154Updated 3 weeks ago
- auto fine tune of models with synthetic data☆76Updated last year
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆286Updated last month
- ☆135Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆99Updated last year
- ☆35Updated 3 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆193Updated last year
- converts url content into JSON with a simple prefix☆70Updated last year
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆127Updated 10 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year