instructor-ai / evals
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆48Updated 3 months ago
Alternatives and similar repositories for evals:
Users that are interested in evals are comparing it to the libraries listed below
- ☆56Updated 2 months ago
- Chat Markup Language conversation library☆55Updated last year
- Verbosity control for AI agents☆59Updated 7 months ago
- Convert a web page to markdown☆62Updated 4 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆46Updated 9 months ago
- Routing on Random Forest (RoRF)☆98Updated 3 months ago
- ☆73Updated this week
- A strongly typed Python DSL for developing message passing multi agent systems☆50Updated 9 months ago
- A framework for evaluating function calls made by LLMs☆36Updated 5 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆76Updated last month
- Tools to make language models a bit easier to use☆32Updated last month
- Annoucing Instructor Cloud☆34Updated 5 months ago
- ☆75Updated 11 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆58Updated 6 months ago
- converts url content into JSON with a simple prefix☆64Updated 8 months ago
- ☆48Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆53Updated last month
- Using modal.com to process FineWeb-edu data☆19Updated last month
- ☆76Updated 7 months ago
- ☆77Updated 7 months ago
- auto fine tune of models with synthetic data☆74Updated 11 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆100Updated last week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆53Updated this week
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 8 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 9 months ago
- ☆18Updated 3 months ago
- Build reliable, secure, and production-ready AI apps easily.☆55Updated this week