instructor-ai / evals
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆50Updated 6 months ago
Alternatives and similar repositories for evals:
Users that are interested in evals are comparing it to the libraries listed below
- Annoucing Instructor Cloud☆34Updated 7 months ago
- ☆66Updated 5 months ago
- Verbosity control for AI agents☆61Updated 10 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆62Updated last month
- Chat Markup Language conversation library☆55Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 11 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆80Updated 4 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 8 months ago
- ☆77Updated 10 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last week
- A strongly typed Python DSL for developing message passing multi agent systems☆52Updated last year
- ☆78Updated 10 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Convert a web page to markdown☆69Updated 7 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- converts url content into JSON with a simple prefix☆67Updated 11 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆97Updated this week
- ☆48Updated last year
- Handout for a talk I gave about LLM and CLI tools☆62Updated 10 months ago
- ☆51Updated 10 months ago
- Leverage your LangChain trace data for fine tuning☆41Updated 8 months ago
- ☆28Updated 3 weeks ago
- A couple scripts to grab stats from email☆42Updated 7 months ago
- ☆47Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 8 months ago
- Deploy a FastHTML app in just a few lines of simple python code on Modal's serverless infra.☆26Updated 7 months ago
- Tools to make language models a bit easier to use☆41Updated this week
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- Routing on Random Forest (RoRF)☆139Updated 6 months ago