MoonshotAI / K2-Vendor-VerifierLinks
Verify Precision of all Kimi K2 API Vendor
☆513Updated 2 weeks ago
Alternatives and similar repositories for K2-Vendor-Verifier
Users that are interested in K2-Vendor-Verifier are comparing it to the libraries listed below
Sorting:
- The LLM abstraction layer for modern AI agent applications.☆507Updated last week
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆228Updated 3 months ago
- Community maintained hardware plugin for vLLM on Apple Silicon☆400Updated last week
- A command-line interface tool for serving LLM using vLLM.☆471Updated 2 weeks ago
- Coding problems used in aider's polyglot benchmark☆199Updated last year
- Train Large Language Models on MLX.☆258Updated this week
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆430Updated last week
- ☆308Updated 3 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆636Updated last month
- ☆265Updated 3 months ago
- Provider-agnostic, open-source evaluation infrastructure for language models☆719Updated last month
- Force DeepSeek r1 models to think for as long as you wish☆373Updated 11 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆259Updated last month
- Use Claude Code with any LLM provider - GLM-4.5, Kimi-K2, Qwen3-Coder, DeepSeek, etc.☆378Updated 4 months ago
- Claude Deep Research config for Claude Code.☆226Updated 10 months ago
- Prompt-to-Leaderboard☆271Updated 9 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆282Updated last month
- The Open Deep Research app – generate reports with OSS LLMs☆316Updated 2 weeks ago
- LLMProc: Unix-inspired runtime that treats LLMs as processes.☆34Updated 6 months ago
- The State Of The Art, intelligence☆157Updated 6 months ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 11 months ago
- ☆238Updated 2 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆225Updated 5 months ago
- ☆757Updated last week
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆902Updated last week
- ☆94Updated 7 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆278Updated 2 months ago
- ☆135Updated 9 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆573Updated 2 months ago
- A tool to use the Ai2 Open Coding Agents Soft-Verified Efficient Repository Agents (SERA) model with Claude Code☆220Updated last week