simonw / llm-evals-pluginLinks
Run evals using LLM
☆26Updated last year
Alternatives and similar repositories for llm-evals-plugin
Users that are interested in llm-evals-plugin are comparing it to the libraries listed below
Sorting:
- Handout for a talk I gave about LLM and CLI tools☆62Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆89Updated 11 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 9 months ago
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- ☆84Updated last year
- Minimal example of MCP for parsing llms.txt☆40Updated 7 months ago
- ☆77Updated last year
- Convert a web page to markdown☆80Updated last year
- https://verdad.app☆83Updated this week
- LLM plugin for models hosted on Replicate☆64Updated last year
- The LLM plugins directory☆44Updated 2 years ago
- LLM plugin for embeddings using sentence-transformers☆72Updated 6 months ago
- Widgets to make it easy to add labels☆37Updated 3 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆80Updated 9 months ago
- ☆63Updated 3 months ago
- Verbosity control for AI agents☆64Updated last year
- LLM plugin providing access to Mistral models using the Mistral API☆200Updated 3 months ago
- ☆172Updated 3 weeks ago
- 🧡 Hacker News summaries☆21Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆106Updated 2 months ago
- Claudette is Claude's friend☆295Updated 2 weeks ago
- Import unstructured data (text and images) into structured tables☆157Updated last week
- ☆21Updated last year
- Demos of ChatGPT's function calling/structured data support.☆24Updated last year
- Embedding models from Jina AI☆65Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆35Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.☆133Updated 8 months ago
- Tools to make language models a bit easier to use☆58Updated last month
- Chat Markup Language conversation library☆55Updated last year