instructor-ai / evals
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆50Updated 5 months ago
Alternatives and similar repositories for evals:
Users that are interested in evals are comparing it to the libraries listed below
- ☆58Updated 4 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆80Updated 3 months ago
- Verbosity control for AI agents☆60Updated 9 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆60Updated last month
- Annoucing Instructor Cloud☆34Updated 7 months ago
- Tools to make language models a bit easier to use☆39Updated 2 weeks ago
- Chat Markup Language conversation library☆55Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- ☆77Updated 9 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆90Updated last month
- A framework for evaluating function calls made by LLMs☆37Updated 7 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 11 months ago
- Deploy a FastHTML app in just a few lines of simple python code on Modal's serverless infra.☆26Updated 7 months ago
- ☆28Updated last week
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Convert a web page to markdown☆66Updated 6 months ago
- converts url content into JSON with a simple prefix☆67Updated 10 months ago
- ☆76Updated 9 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Anthropic Computer Use with Modal Sandboxes☆31Updated 4 months ago
- Leverage your LangChain trace data for fine tuning☆41Updated 7 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆124Updated this week
- ☆48Updated last year
- ☆75Updated last year
- ☆31Updated last year
- A couple scripts to grab stats from email☆42Updated 6 months ago
- Routing on Random Forest (RoRF)☆134Updated 5 months ago
- ☆4Updated 7 months ago
- Code interpreter support for o1☆32Updated 6 months ago
- auto fine tune of models with synthetic data☆74Updated last year