JonathanChavezTamales / LLMStats
A comprehensive set of LLM benchmark scores and provider prices.
☆182Updated last month
Alternatives and similar repositories for LLMStats:
Users that are interested in LLMStats are comparing it to the libraries listed below
- You don’t need to read the code to understand how to build!☆185Updated 3 months ago
- ☆98Updated 3 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆429Updated 6 months ago
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆204Updated last month
- 🤗 Benchmark Large Language Models Reliably On Your Data☆233Updated this week
- A timeline of notable generative AI events☆84Updated this week
- The easiest, and fastest way to run AI-generated Python code safely☆297Updated 4 months ago
- 🤖 Headless IDE for AI agents☆181Updated this week
- E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.☆587Updated last week
- Sidecar is the AI brains for the Aide editor and works alongside it, locally on your machine☆542Updated last week
- FastMLX is a high performance production ready API to host MLX models.☆289Updated last month
- Overide (pronounced over·ide) is a lightweight, yet powerful CLI tool that seamlessly integrates AI-powered code generation into your dev…☆171Updated last week
- ☆122Updated last month
- An open-source implementation of Anthropic's Computer Use to perform basic tasks using AI Agents.☆262Updated 5 months ago
- A Multi-Agent AI Tool that creates beautiful presentations with voice-overs 🎦🔥☆163Updated 2 months ago
- A simple Python program to implement the search-extract-summarize flow.☆260Updated 2 months ago
- LiveBench: A Challenging, Contamination-Free LLM Benchmark☆655Updated this week
- Letting Claude Code develop his own MCP tools :)☆97Updated last month
- ☆226Updated 5 months ago
- ☆289Updated last month
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆113Updated last month
- Routing on Random Forest (RoRF)☆141Updated 6 months ago
- ☆183Updated 4 months ago
- Official repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"☆47Updated last week
- A flexible, adaptive classification system for dynamic text classification☆154Updated last month
- Tutorial for building LLM router☆193Updated 8 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆105Updated 5 months ago
- PocketFlow's node-based workflow structure, with Manus' agents and tools!☆178Updated this week
- ☆153Updated 9 months ago
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆105Updated 2 weeks ago