Ask Kimi K2.7 Code

Kimi K2.7 Code is a coding and software development AI model from Moonshot AI’s Kimi K2 family. Featuring a native multimodal mixture-of-experts architecture, the model can analyze text and images, perform advanced reasoning, and maintain context across up to 256K tokens. It activates 32 billion parameters per request from an underlying architecture of roughly 1 trillion parameters, enabling efficient performance on complex programming tasks.

Kimi K2.7 — a coding-focused model in Moonshot AI's Kimi K2 family.

Kimi K2.7 Code is the most capable agentic coding model in Moonshot's lineup at a fraction of the price of Claude Opus 4.8 and GPT-5.5 — roughly 5–12× cheaper per token, with a 256K context window, a HighSpeed endpoint for latency-sensitive agentic loops, and open weights you can audit or self-host on your own hardware. For teams running high-volume coding pipelines, IDE assistants and long-horizon engineering tasks, the cost curve makes K2.7 Code the obvious default to put against Claude and GPT on your real workload before deciding.

What is Moonshot Kimi K2.7 Code?

Kimi K2.7 Code is Moonshot AI's coding-focused open-weight flagship, released on June 12, 2026 as the successor to K2.6. It belongs to the Kimi K2 family — a lineup of native multimodal mixture-of-experts models tuned for agentic coding, long-context reasoning and autonomous multi-step work — and ships under a Modified MIT license, with weights publicly available on Hugging Face (moonshotai/Kimi-K2.7-Code) and a hosted API on platform.moonshot.ai compatible with both the OpenAI and Anthropic SDKs.

Minimalist UI illustration showing Overchat AI chat and document interface, with layered cards, message bubbles, and simplified icons in blue and white, representing AI-powered communication and content generation.
1️⃣

Trillion-parameter MoE, 32B active

Kimi K2.7 Code keeps the K2-family architecture — 1T total parameters, 32B activated per token — and inherits a 262,144-token (256K) context window, large enough to hold an entire repository, an extended specification, or a deep research dossier in view. The model always operates in thinking mode, but K2.7 Code reduces thinking-token usage by approximately 30% compared with K2.6, materially lowering output cost on agentic tasks without sacrificing reasoning depth.

🧠

HighSpeed mode at ~6× throughput

Moonshot ships a separate kimi-k2.7-code-highspeed endpoint that pushes around 180 output tokens per second, climbing to 260 in short-context runs — roughly six times the throughput of the standard endpoint. Same weights, same reasoning quality, with the latency profile you need for agentic loops and IDE-side completions.

♾️

30% fewer thinking tokens

Kimi K2.7 Code reduces thinking-token consumption by approximately 30% compared with K2.6 — the same reasoning behavior, with materially shorter chains of thought. On a multi-step coding task that previously burned through 50K output tokens, you now finish closer to 35K, which translates directly into a 30% cut to per-task cost on every agentic run.

Three steps to start a Kimi K2.7 Code session

1.

Launch Overchat AI

Head to overchat.ai or install the mobile app, then pick Kimi K2.7 Code from the model menu.

2.

Send your first prompt

Drop in a coding task, paste a repository or attach files — PDFs, DOCX, slide decks, images — and Kimi K2.7 Code will pull them into context and start working.

3.

Let it run

Keep pushing tasks at it — refactors, research, drafts, migrations — and let its long-horizon agents carry the work.

Get Started
Ask Kimi K2.7 Code on Overchat AI

Why use Kimi K2.7 Code?

Kimi K2.7 Code is Moonshot AI's coding-focused open-weight release, launched on June 12, 2026 as the successor to K2.6. It belongs to the Kimi K2 family — a lineup of native multimodal mixture-of-experts models tuned for agentic coding, long-context reasoning and autonomous multi-step work. Overchat AI provides direct access without a Moonshot account or a Chinese phone number.

Moonshot AI, the team behind Kimi, was in Beijing in March 2023 by Yang Zhilin, Zhou Xinyu, and Wu Yuxin — all Tsinghua alumni. Yang earned his PhD at Carnegie Mellon and worked on NLP at Meta and Google Brain before co-founding the lab, which has since become one of China's most visible open-source AI labs.

Kimi K2.7 Code is Moonshot's strongest coding release to date, with substantial gains over K2.6 on long-horizon engineering tasks and agentic benchmarks — at a fraction of the per-token price of Claude Opus 4.8 and GPT-5.5.

What are the main features of Kimi K2.7 Code?

A 256K-token context window lets Kimi K2.7 Code reason over long codebases, multi-file pull requests or extended research archives in a single pass. Combined with multi-head latent attention and the K2-family mixture-of-experts routing, the model keeps inference costs in check even when the conversation stretches across hundreds of thousands of tokens.

Against the previous Kimi K2.5, the 2.6 release lands larger jumps on coding evals, longer autonomous runs, and a smoother agent swarm that scales further without drifting off-task.stronger reasoning, better factual accuracy, and noticeably improved writing quality.

Kimi K2.7 Code ships with open weights under a Modified MIT license, so you can pull the weights from Hugging Face and run Kimi K2.7 Code on your own hardware. If you'd rather skip the setup, Overchat AI lets you chat with it instantly — no Moonshot API key, no platform.moonshot.ai account, no China-region phone number required.

Moonshot ships Kimi K2.7 Code in two endpoints: the standard kimi-k2.7-code for full reasoning quality, and kimi-k2.7-code-highspeed for latency-sensitive workloads at roughly six times the token throughput. Both endpoints use the same underlying weights and both accept OpenAI- and Anthropic-compatible API calls.

What makes Moonshot stand out is how much it leans into safety and transparencyOn Moonshot's own evaluation suite, Kimi K2.7 Code posts +21.8% on Kimi Code Bench v2, +11.0% on Program Bench and +31.5% on MLS Bench Lite versus K2.6, plus 81.1 on MCP Mark Verified for correct tool invocation through the Model Context Protocol. Independent third-party numbers on SWE-bench Verified, SWE-bench Pro, Terminal-Bench and LiveCodeBench are not yet available, so vendor benchmarks should be treated as directional until reproduced.perfect for critical tasks where you need a strong coding partner you can audit end to end.

Kimi K2.7 Code Benchmars

Moonshot reports roughly +10% over K2.6, with notably more reliable instruction-following over long contexts and higher end-to-end completion rates on multi-step engineering tasks. Against Claude Opus 4.8 (SWE-bench Verified 88.6%) and GPT-5.5 (Terminal-Bench 82.7%), Kimi K2.7 Code's strongest lever is price-performance: roughly 5–12× cheaper per token, with comparable agentic completion rates on long-horizon coding tasks at a fraction of the latency thanks to the HighSpeed endpoint.

FAQ

What is Kimi K2.7 Code?

Kimi K2.7 Code is Moonshot AI's coding-focused open-weight flagship, released on June 12, 2026 as the successor to Kimi K2.6. It uses a 1-trillion-parameter mixture-of-experts architecture that activates 32 billion parameters per request, supports a 262,144-token (256K) context window, and always operates in thinking mode — with thinking-token usage cut by roughly 30% versus K2.6. You can run it on Overchat AI alongside Claude Opus 4.8, GPT-5.5 and Gemini 3 Pro.

Where can I access Kimi K2.7 Code?

The quickest route is Overchat AI: open www.overchat.ai, sign up, and choose Kimi K2.7 Code from the model dropdown. You can also reach it through Moonshot's own platform.moonshot.ai API (OpenAI- and Anthropic-compatible), via partners like OpenRouter, Fireworks AI and Unsloth, or by downloading the open weights from Hugging Face (moonshotai/Kimi-K2.7-Code) and running it on your own hardware.

How much does Kimi K2.7 Code cost?

Moonshot's official pricing for Kimi K2.7 Code is roughly $0.75 per million input tokens and $3.50 per million output tokens, with cache-hit reads at $0.19 per million tokens — partner pricing on OpenRouter and Fireworks sits a touch higher at about $0.95 / $4.00. That undercuts Claude Opus 4.8 ($5 / $25) and GPT-5.5 ($5 / $30) by roughly 5–12× per token, which is the headline reason teams put it in their stack for high-volume coding workloads.

What is Kimi K2.7 Code HighSpeed mode?

HighSpeed (kimi-k2.7-code-highspeed) is a separate endpoint Moonshot announced on June 15, 2026 — three days after the K2.7 Code release. It delivers approximately 180 output tokens per second, climbing to 260 in short-context scenarios, which is roughly six times faster than the standard kimi-k2.7-code endpoint. Both endpoints share the same underlying weights, so HighSpeed is the obvious pick for latency-sensitive agentic workflows where end-to-end completion time matters more than peak reasoning depth.

How does Kimi K2.7 Code compare to Claude Opus 4.8 and GPT-5.5?

Kimi K2.7 Code's strongest selling point against Claude Opus 4.8 and GPT-5.5 is price — it's roughly 5–12× cheaper per token — and agentic capability, where Moonshot's own numbers show meaningful gains over K2.6. On independent coding benchmarks (SWE-bench Verified, Terminal-Bench, LiveCodeBench) Claude Opus 4.8 and GPT-5.5 still lead because no third-party numbers for K2.7 have been published yet. For high-volume agentic coding pipelines the price-performance is hard to beat; for one-shot top-of-the-leaderboard reasoning, Claude and GPT remain the safer call.

Is Kimi K2.7 Code open-source?

Yes. Moonshot AI ships Kimi K2.7 Code as open-weight under a Modified MIT license, with the weights publicly available on Hugging Face (moonshotai/Kimi-K2.7-Code). You can inspect, fine-tune and self-host the model on your own hardware, or use it through Moonshot's hosted platform.moonshot.ai API, OpenRouter, Fireworks AI, Unsloth, or Overchat AI — whichever route fits your stack.

Are the Kimi K2.7 Code benchmarks reliable?

Mixed. All published Kimi K2.7 Code scores at release are Moonshot's own — +21.8% on Kimi Code Bench v2, +11% on Program Bench, +31.5% on MLS Bench Lite, 81.1 on MCP Mark Verified — and Moonshot did not submit K2.7 to SWE-bench Verified, SWE-bench Pro, Terminal-Bench or LiveCodeBench, which several practitioners flagged after launch. Treat the numbers as directional, run K2.7 Code on your own coding workload, and compare it against K2.6, Claude Opus 4.8 and GPT-5.5 on your tasks before committing it to production.

Explore more AI models

Chat GPT Logo

GPT-5.5

OpenAI's most advanced model with exceptional reasoning, creativity, and multimodal capabilities.

Ask GPT ↗
DeepSeek logo

DeepSeek V4

Advanced reasoning model designed for complex problem solving, mathematical reasoning, and programming.

Ask DeepSeek ↗
Claude logo

Claude Opus 4.8

Anthropic's flagship model excelling at reasoning, knowledge, math, and coding tasks.

Ask Claude ↗
Gemini Logo

Gemini 3.5

Google's most capable model with advanced multimodal understanding and generation.

Ask Gemini ↗
Grok logo

Grok 4.3

xAI's powerful model with real-time knowledge and witty, direct responses.

Ask Grok ↗
Qwen logo

Qwen 3.7

Alibaba's advanced model with strong multilingual capabilities and reasoning skills.

Ask Qwen ↗

About Overchat AI

Overchat AI brings you the power of the world's top AI models: ChatGPT, Claude, Gemini, Mistral, and more.

Overchat AI Interface

From The Blog

Overchat AI For All Platforms

Available on Web, iOS, and Android. Access your AI assistant anywhere, anytime.

Google Play Store badgeApp Store badge
Overchat AI Desktop and mobile interfaces