medaks / medask-benchmarksLinks
A novel approach to evaluating AI agents on diagnostic accuracy in symptom checking tasks.
☆24Updated 2 months ago
Alternatives and similar repositories for medask-benchmarks
Users that are interested in medask-benchmarks are comparing it to the libraries listed below
Sorting:
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆33Updated 6 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆45Updated 3 weeks ago
- ☆30Updated 9 months ago
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆146Updated this week
- ☆49Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- ☆22Updated 3 months ago
- ☆23Updated 7 months ago
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆73Updated 3 weeks ago
- ☆115Updated 2 months ago
- Build Web Datasets with Ease☆33Updated last year
- ☆17Updated 9 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- "Just hoof it!" - A spotlight like interface to Ollama☆62Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆54Updated 11 months ago
- ☆107Updated 8 months ago
- AI agent web app platform☆68Updated this week
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆52Updated 7 months ago
- ☆24Updated 8 months ago
- ☆17Updated last year
- Attend - to what matters.☆17Updated 7 months ago
- Finally, an open source Youtube Summarizer extension☆75Updated 5 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆80Updated last year
- Small tools to enhance your AI app with little effort.☆13Updated last year
- The library for character-driven AI experiences.☆88Updated last year
- Local drive deep search.☆33Updated 3 months ago
- Simple examples using Argilla tools to build AI☆55Updated 10 months ago
- Pivotal Token Search☆125Updated 2 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Updated 7 months ago