MoonshotAI / K2-Vendor-VerifierLinks
Verify Precision of all Kimi K2 API Vendor
☆494Updated last week
Alternatives and similar repositories for K2-Vendor-Verifier
Users that are interested in K2-Vendor-Verifier are comparing it to the libraries listed below
Sorting:
- The LLM abstraction layer for modern AI agent applications.☆496Updated this week
- Train Large Language Models on MLX.☆240Updated this week
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆226Updated 2 months ago
- Community maintained hardware plugin for vLLM on Apple Silicon☆217Updated this week
- ☆304Updated 2 months ago
- Coding problems used in aider's polyglot benchmark☆199Updated last year
- Super basic implementation (gist-like) of RLMs with REPL environments.☆435Updated last week
- A command-line interface tool for serving LLM using vLLM.☆461Updated last month
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆404Updated 2 weeks ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆279Updated last week
- ☆236Updated last month
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆240Updated last week
- Distributed Inference for mlx LLm☆100Updated last year
- Letting Claude Code develop his own MCP tools :)☆122Updated 10 months ago
- LLMProc: Unix-inspired runtime that treats LLMs as processes.☆34Updated 5 months ago
- Claude Deep Research config for Claude Code.☆224Updated 9 months ago
- Proxy server that converts Anthropic API requests to OpenAI format and sends it to OpenRouter. It's used to use Claude Code with OpenRout…☆394Updated 8 months ago
- Provider-agnostic, open-source evaluation infrastructure for language models☆705Updated 3 weeks ago
- Force DeepSeek r1 models to think for as long as you wish☆373Updated 10 months ago
- proof-of-concept of Cursor's Instant Apply feature☆88Updated last year
- Routing on Random Forest (RoRF)☆238Updated last year
- ☆263Updated 2 months ago
- Prompt-to-Leaderboard☆271Updated 8 months ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆241Updated 5 months ago
- Agent computer interface for AI software engineer.☆114Updated last month
- A clean, modular SDK for building AI agents with OpenHands V1.☆412Updated this week
- ☆916Updated this week
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆134Updated this week
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆381Updated this week
- ☆107Updated 2 months ago