MoonshotAI / K2-Vendor-VerifierLinks
Verify Precision of all Kimi K2 API Vendor
☆501Updated last week
Alternatives and similar repositories for K2-Vendor-Verifier
Users that are interested in K2-Vendor-Verifier are comparing it to the libraries listed below
Sorting:
- The LLM abstraction layer for modern AI agent applications.☆499Updated last week
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆226Updated 3 months ago
- ☆304Updated 3 months ago
- Coding problems used in aider's polyglot benchmark☆199Updated last year
- A command-line interface tool for serving LLM using vLLM.☆468Updated last week
- Community maintained hardware plugin for vLLM on Apple Silicon☆349Updated last week
- Train Large Language Models on MLX.☆245Updated last week
- LLMProc: Unix-inspired runtime that treats LLMs as processes.☆34Updated 6 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆282Updated 3 weeks ago
- ☆264Updated 2 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆424Updated last week
- Provider-agnostic, open-source evaluation infrastructure for language models☆714Updated last month
- Super basic implementation (gist-like) of RLMs with REPL environments.☆574Updated 3 weeks ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 10 months ago
- Proxy server that converts Anthropic API requests to OpenAI format and sends it to OpenRouter. It's used to use Claude Code with OpenRout…☆395Updated 9 months ago
- 🧠 Advanced Claude streaming interface with interleaved thinking, dynamic tool discovery, and MCP integration. Watch Claude think through…☆185Updated 7 months ago
- The State Of The Art, intelligence☆157Updated 5 months ago
- ☆135Updated 9 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆148Updated this week
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆251Updated 3 weeks ago
- The Open Deep Research app – generate reports with OSS LLMs☆316Updated last week
- ☆237Updated 2 months ago
- Claude Deep Research config for Claude Code.☆225Updated 10 months ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆241Updated 5 months ago
- Evolve your language agent with Agentic Context Engineering (ACE)☆576Updated 2 weeks ago
- REAP: Router-weighted Expert Activation Pruning for SMoE compression☆222Updated last month
- Prompt-to-Leaderboard☆271Updated 8 months ago
- ☆94Updated 6 months ago
- ☆391Updated 4 months ago
- Use Claude Code with any LLM provider - GLM-4.5, Kimi-K2, Qwen3-Coder, DeepSeek, etc.☆378Updated 4 months ago