jayminban / 41-llms-evaluated-on-19-benchmarksLinks
This project benchmarks 41 open-source large language models across 19 evaluation tasks using the lm-evaluation-harness library.
☆72Updated 3 weeks ago
Alternatives and similar repositories for 41-llms-evaluated-on-19-benchmarks
Users that are interested in 41-llms-evaluated-on-19-benchmarks are comparing it to the libraries listed below
Sorting:
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆28Updated last month
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated last year
- Self-hosted AI medical scribe.☆50Updated last week
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆275Updated last month
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆126Updated 11 months ago
- ☆165Updated last month
- Enhancing LLMs with LoRA☆137Updated 2 weeks ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆100Updated last month
- ☆209Updated 2 weeks ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50Updated 4 months ago
- ☆132Updated 5 months ago
- Convert URLs into LLM-friendly markdown chunks☆65Updated last year
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 7 months ago
- ☆48Updated 6 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated 8 months ago
- This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support☆224Updated last month
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆146Updated 3 months ago
- A comprehensive list of document parsers, covering PDF-to-text conversion and layout extraction. Each tested for support of tables, equat…☆154Updated 2 months ago
- Analyze Reddit posts☆25Updated 6 months ago
- InferX: Inference as a Service Platform☆135Updated this week
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆45Updated 3 weeks ago
- The Fastest Way to Fine-Tune LLMs Locally☆321Updated 6 months ago
- ☆53Updated 7 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆95Updated 2 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆52Updated 7 months ago
- A frontend for creative writing with LLMs☆134Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- Open source LLM UI, compatible with all local LLM providers.☆174Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆82Updated last week