instructor-ai / evalsLinks
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆52Updated last year
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- Verbosity control for AI agents☆64Updated last year
- ☆84Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 9 months ago
- Annoucing Instructor Cloud☆38Updated last year
- ☆78Updated last year
- Convert a web page to markdown☆80Updated last year
- ☆46Updated this week
- ☆197Updated last week
- ☆198Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- ☆36Updated 6 months ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆106Updated 2 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆89Updated last year
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆322Updated 2 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆197Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆154Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆101Updated last year
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- auto fine tune of models with synthetic data☆76Updated last year
- ☆172Updated last month
- ☆53Updated 7 months ago
- ☆45Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Updated 7 months ago
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆368Updated 2 months ago
- Minimal example of MCP for parsing llms.txt☆40Updated 7 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- converts url content into JSON with a simple prefix☆71Updated last year
- ☆159Updated 11 months ago