Vision-Empower / Kimi-K2-MiniLinks

A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.

☆35

Alternatives and similar repositories for Kimi-K2-Mini

Users that are interested in Kimi-K2-Mini are comparing it to the libraries listed below

Sorting:

jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆42Updated last month
deepgrove-ai / Bonsai
☆31Updated 5 months ago
shirley-wu / cot_decoding
☆45Updated last year
bradhilton / temporal-clue
Clue inspired puzzles for testing LLM deduction abilities
☆40Updated 5 months ago
InternLM / SWE-Fixer
☆113Updated 3 months ago
microsoft / GRIN-MoE
GRadient-INformed MoE
☆264Updated 11 months ago
pseudotensor / open-strawberry
Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…
☆183Updated 10 months ago
SWE-bench / SWE-smith
Scaling Data for SWE-agents
☆386Updated last week
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆85Updated 3 months ago
Arvid-pku / Godel_Agent
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
☆129Updated 6 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆109Updated 4 months ago
rodrigobaron / anthill
☆24Updated 7 months ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆92Updated 3 months ago
ZhangYiqun018 / AvengersPro
☆134Updated last week
WhitzardIndex / self-replication-research
A preprint version of our recent research on the capability of frontier AI systems to do self-replication
☆59Updated 8 months ago
eqimp / hogwild_llm
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆121Updated 3 weeks ago
chigkim / Ollama-MMLU-Pro
☆96Updated last week
bigcode-project / selfcodealign
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
☆310Updated 6 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆146Updated 6 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 7 months ago
calmstate / VisualTagger
Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…
☆10Updated 10 months ago
Aider-AI / polyglot-benchmark
Coding problems used in aider's polyglot benchmark
☆175Updated 8 months ago
uukuguy / speechless
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
☆106Updated last month
YerbaPage / MGDebugger
Multi-Granularity LLM Debugger
☆89Updated last month
reka-ai / rekaquant
☆60Updated last month
NL2Code / CodeR
☆159Updated last year
codelion / pts
Pivotal Token Search
☆123Updated last month
menloresearch / ReZero
☆154Updated 4 months ago
lmarena / p2l
Prompt-to-Leaderboard
☆250Updated 3 months ago
ChenxinAn-fdu / POLARIS
Scaling RL on advanced reasoning models
☆574Updated 3 weeks ago