MoonshotAI / Kimi-K2Links
Kimi K2 is the large language model series developed by Moonshot AI team
☆1,850Updated this week
Alternatives and similar repositories for Kimi-K2
Users that are interested in Kimi-K2 are comparing it to the libraries listed below
Sorting:
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆2,631Updated this week
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆655Updated this week
- Big & Small LLMs working together☆1,058Updated this week
- Releases from OpenAI Preparedness☆792Updated last month
- open-source coding LLM for software engineering tasks☆726Updated 2 weeks ago
- This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software E…☆1,427Updated last month
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆798Updated last month
- ☆1,166Updated 2 months ago
- Lightweight coding agent that runs in your terminal☆1,900Updated 2 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,501Updated last month
- Training Large Language Model to Reason in a Continuous Latent Space☆1,185Updated 5 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,123Updated 5 months ago
- ☆3,392Updated 4 months ago
- Dream 7B, a large diffusion language model☆816Updated 3 weeks ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆527Updated last month
- Democratizing Reinforcement Learning for LLMs☆3,744Updated this week
- Muon is Scalable for LLM Training☆1,093Updated 3 months ago
- Official Repository of Absolute Zero Reasoner☆1,593Updated last week
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆544Updated this week
- The #1 open-source SWE-bench Verified implementation☆758Updated last month
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆1,108Updated this week
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,817Updated 3 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆577Updated 3 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,434Updated this week
- 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆1,136Updated this week
- Exa is a Web Search API | This is Exa MCP (Model Context Protocol)☆1,838Updated last week
- MemOS (Preview) | Intelligence Begins with Memory☆638Updated this week
- Run LLMs with MLX☆1,276Updated this week
- Sky-T1: Train your own O1 preview model within $450☆3,300Updated this week
- E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.☆1,000Updated last week