MoonshotAI / K2-Vendor-VerifierLinks
Verify Precision of all Kimi K2 API Vendor
☆442Updated last week
Alternatives and similar repositories for K2-Vendor-Verifier
Users that are interested in K2-Vendor-Verifier are comparing it to the libraries listed below
Sorting:
- The LLM abstraction layer for modern AI agent applications.☆464Updated this week
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆210Updated last month
- Train Large Language Models on MLX.☆223Updated this week
- ☆289Updated last month
- Coding problems used in aider's polyglot benchmark☆193Updated 11 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆273Updated last month
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆220Updated last week
- ☆234Updated 4 months ago
- A command-line interface tool for serving LLM using vLLM.☆445Updated last month
- Distributed Inference for mlx LLm☆99Updated last year
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆278Updated 3 months ago
- Prompt-to-Leaderboard☆265Updated 6 months ago
- LLMProc: Unix-inspired runtime that treats LLMs as processes.☆33Updated 4 months ago
- Library for model distillation☆158Updated 2 months ago
- AI benchmark runtime framework that allows you to integrate and evaluate AI tasks using Docker-based benchmarks.☆165Updated 6 months ago
- ☆261Updated 3 weeks ago
- ☆135Updated 7 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆382Updated this week
- Provider-agnostic, open-source evaluation infrastructure for language models☆667Updated this week
- Claude Deep Research config for Claude Code.☆223Updated 8 months ago
- proof-of-concept of Cursor's Instant Apply feature☆85Updated last year
- LLM inference in C/C++☆103Updated this week
- The Open Deep Research app – generate reports with OSS LLMs☆311Updated last week
- Force DeepSeek r1 models to think for as long as you wish☆372Updated 9 months ago
- ☆94Updated 4 months ago
- Train your own SOTA deductive reasoning model☆107Updated 8 months ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 8 months ago
- ☆68Updated 6 months ago
- LLM-as-SERP☆69Updated 8 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆205Updated 3 months ago