Vision-Empower / Kimi-K2-MiniLinks
A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.
☆35Updated last month
Alternatives and similar repositories for Kimi-K2-Mini
Users that are interested in Kimi-K2-Mini are comparing it to the libraries listed below
Sorting:
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆42Updated last month
- ☆31Updated 5 months ago
- ☆45Updated last year
- Clue inspired puzzles for testing LLM deduction abilities☆40Updated 5 months ago
- ☆113Updated 3 months ago
- GRadient-INformed MoE☆264Updated 11 months ago
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆183Updated 10 months ago
- Scaling Data for SWE-agents☆386Updated last week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆85Updated 3 months ago
- Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement☆129Updated 6 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆109Updated 4 months ago
- ☆24Updated 7 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆92Updated 3 months ago
- ☆134Updated last week
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated 8 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆121Updated 3 weeks ago
- ☆96Updated last week
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆310Updated 6 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 7 months ago
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆10Updated 10 months ago
- Coding problems used in aider's polyglot benchmark☆175Updated 8 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆106Updated last month
- Multi-Granularity LLM Debugger☆89Updated last month
- ☆60Updated last month
- ☆159Updated last year
- Pivotal Token Search☆123Updated last month
- ☆154Updated 4 months ago
- Prompt-to-Leaderboard☆250Updated 3 months ago
- Scaling RL on advanced reasoning models☆574Updated 3 weeks ago