instructor-ai / evals
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆51Updated 7 months ago
Alternatives and similar repositories for evals:
Users that are interested in evals are comparing it to the libraries listed below
- ☆67Updated 5 months ago
- ☆31Updated last week
- Verbosity control for AI agents☆63Updated 11 months ago
- Annoucing Instructor Cloud☆36Updated 8 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆85Updated 5 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆63Updated 2 months ago
- converts url content into JSON with a simple prefix☆68Updated 11 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last month
- Chat Markup Language conversation library☆55Updated last year
- ☆4Updated 8 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- A framework for evaluating function calls made by LLMs☆37Updated 9 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Leverage your LangChain trace data for fine tuning☆41Updated 9 months ago
- Convert a web page to markdown☆70Updated 8 months ago
- Minimal example of MCP for parsing llms.txt☆37Updated 3 weeks ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 9 months ago
- ☆30Updated last month
- auto fine tune of models with synthetic data☆75Updated last year
- ☆77Updated 11 months ago
- ☆78Updated 11 months ago
- ☆32Updated last year
- Logging and caching superpowers for the openai sdk☆105Updated last year
- ☆47Updated 3 weeks ago
- Foyle is a copilot to help developers deploy and operate their applications.☆127Updated last month
- Website for Applied-LLMs work☆26Updated 2 weeks ago
- ☆47Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- ☆48Updated last year