instructor-ai / evalsLinks
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆52Updated 11 months ago
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- Verbosity control for AI agents☆65Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆65Updated 6 months ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆290Updated 2 weeks ago
- ☆82Updated 10 months ago
- ☆196Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆87Updated 9 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- Convert a web page to markdown☆78Updated last year
- ☆170Updated last week
- Foyle is a copilot to help developers deploy and operate their applications.☆131Updated 5 months ago
- ☆78Updated last year
- Annoucing Instructor Cloud☆37Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 11 months ago
- ☆52Updated 4 months ago
- auto fine tune of models with synthetic data☆76Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- A framework for optimizing DSPy programs with RL☆154Updated last week
- ☆135Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆102Updated last year
- ☆72Updated this week
- ☆35Updated 4 months ago
- Claudette is Claude's friend☆266Updated this week
- ☆166Updated this week
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆194Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆311Updated 2 months ago
- converts url content into JSON with a simple prefix☆71Updated last year
- ☆54Updated last year
- Tools to make language models a bit easier to use☆50Updated last month