simonw / llm-evals-plugin
Run evals using LLM
☆21Updated 10 months ago
Alternatives and similar repositories for llm-evals-plugin:
Users that are interested in llm-evals-plugin are comparing it to the libraries listed below
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆49Updated 5 months ago
- Handout for a talk I gave about LLM and CLI tools☆62Updated 9 months ago
- ☆58Updated 4 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆59Updated 3 weeks ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆80Updated 3 months ago
- ☆77Updated 9 months ago
- Run embedding models using ONNX☆31Updated last year
- Convert a web page to markdown☆66Updated 6 months ago
- applications of https://github.com/PrefectHQ/marvin☆12Updated last year
- Embedding models from Jina AI☆58Updated last year
- LLM plugin for clustering embeddings☆72Updated last year
- Chat Markup Language conversation library☆55Updated last year
- auto fine tune of models with synthetic data☆74Updated last year
- LLM plugin for models hosted on Replicate☆61Updated 10 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 7 months ago
- ☆31Updated last year
- Verbosity control for AI agents☆60Updated 9 months ago
- Tools for LLM agents.☆59Updated 2 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- LLM plugin for embeddings using sentence-transformers☆52Updated last month
- converts url content into JSON with a simple prefix☆66Updated 10 months ago
- Python client for PromptWatch.io - LLM tracking platform☆28Updated 10 months ago
- Demos of ChatGPT's function calling/structured data support.☆23Updated last year
- Deploy a FastHTML app in just a few lines of simple python code on Modal's serverless infra.☆26Updated 6 months ago
- sponge your gmail with artificial intelligence☆22Updated last month
- For LLMs to better code with Jina API☆134Updated last month
- Routing on Random Forest (RoRF)☆130Updated 5 months ago
- ☆47Updated 11 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆59Updated 7 months ago
- Some tough questions to test new models.☆26Updated 10 months ago