JonathanChavezTamales / llm-leaderboardLinks
A comprehensive set of LLM benchmark scores and provider prices.
☆312Updated last week
Alternatives and similar repositories for llm-leaderboard
Users that are interested in llm-leaderboard are comparing it to the libraries listed below
Sorting:
- Hallucination Detector is a free and open-source tool that helps you verify the accuracy of your LLM generated content instantly.☆284Updated 3 months ago
- You don’t need to read the code to understand how to build!☆214Updated 8 months ago
- Provider-agnostic, open-source evaluation infrastructure for language models☆539Updated this week
- A timeline of notable generative AI events☆125Updated this week
- ☆327Updated 4 months ago
- Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models fo…☆238Updated 5 months ago
- An open-source dashboard for Cursor.sh IDE. Log AI code generations, track usage, and control AI models (including local ones). Run local…☆361Updated 10 months ago
- Claude Memory: Long-term memory for Claude☆575Updated 2 weeks ago
- Context infrastructure for AI agents☆341Updated this week
- Giving Claude ability to run code with E2B via MCP (Model Context Protocol)☆331Updated 2 months ago
- Together Open Deep Research☆349Updated 5 months ago
- Smithery helps AI agents access external services via a unified gateway.☆278Updated this week
- A simple MCP integration that allows Claude to read and manage a personal Notion todo list☆202Updated 9 months ago
- Local Groq Desktop chat app with MCP support☆358Updated 2 weeks ago
- AI agents platform that gives you a workspace with an integrated team of personal assistants that can work behind the scenes to handle da…☆184Updated 2 months ago
- ☆221Updated 8 months ago
- ☆149Updated 4 months ago
- From Claude Artifact to deployable React app — in seconds!☆442Updated 2 weeks ago
- This framework works as a form of user/machine calibration, with a focus on user-context and user-intent, deconstructing your ideas logic…☆141Updated this week
- Model Context Protocol server implementation for Reddit☆194Updated 2 months ago
- Claude Deep Research config for Claude Code.☆217Updated 6 months ago
- A Model Context Protocol (MCP) server for research and documentation assistance using Perplexity AI. Won 1st @ Cline Hackathon☆249Updated 2 weeks ago
- aiformat is a simple tool you can use from the command line. It helps you select files and folders and change them into a format that AI …☆216Updated last year
- Learn hands on how to use the Windsurf Editor!☆191Updated last week
- Overide (pronounced over·ide) is a lightweight, yet powerful CLI tool that seamlessly integrates AI-powered code generation into your dev…☆183Updated 2 months ago
- An open-source implementation of Anthropic's Computer Use to perform basic tasks using AI Agents.☆290Updated 10 months ago
- The Open Deep Research app – generate reports with OSS LLMs☆299Updated 2 months ago
- Sidecar is the AI brains for the Aide editor and works alongside it, locally on your machine☆588Updated 4 months ago
- ☆188Updated 10 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆275Updated last month