medaks / medask-benchmarksLinks
A novel approach to evaluating AI agents on diagnostic accuracy in symptom checking tasks.
☆25Updated 7 months ago
Alternatives and similar repositories for medask-benchmarks
Users that are interested in medask-benchmarks are comparing it to the libraries listed below
Sorting:
- ☆30Updated last year
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆35Updated 10 months ago
- ☆121Updated 6 months ago
- ☆21Updated 8 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- Build Web Datasets with Ease☆33Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆63Updated last year
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆49Updated 5 months ago
- ☆107Updated 3 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- ☆108Updated last year
- Pivotal Token Search☆144Updated last month
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- ☆37Updated 6 months ago
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆36Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Updated 2 years ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆56Updated 11 months ago
- ☆17Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆193Updated this week
- ☆50Updated last year
- ☆47Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆221Updated last month
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated last year
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆128Updated this week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated last week