Kimi K2.6 is Moonshot AI's open-source flagship model, released in April 2026. It's a trillion-parameter mixture-of-experts system with 32B active parameters per token, a 256K context window, and leading scores on Humanity's Last Exam and SWE-Bench Pro. You can run it on Overchat AI alongside GPT-5.4, Claude Opus 4.6, and Gemini 3 Pro.

How do you access Kimi K2.6?

The quickest route is Overchat AI: go to www.overchat.ai, sign up for free if you want, and choose Kimi K2.6 from the model dropdown. You can also reach it on kimi.com, through Moonshot's own platform.moonshot.ai API, or by downloading the open weights from Hugging Face.

How do you use Kimi K2.6?

Open Overchat AI, pick Kimi K2.6 in the model selector, and type in what you need. Ask follow-ups, attach files, paste long documents, or kick off agent tasks that span many steps — the interface works just like any modern chatbot, so there's no ramp-up.

Kimi K2.6 was built by Moonshot AI, a Beijing-based AI lab founded in March 2023 by Yang Zhilin, Zhou Xinyu, and Wu Yuxin — all Tsinghua University alumni. Yang, the CEO, previously researched NLP at Meta and Google Brain and earned his PhD at Carnegie Mellon.

Kimi K2.7 Code

Kimi K2.7 — a coding-focused model in Moonshot AI's Kimi K2 family.

Kimi K2.7 Code is the most capable agentic coding model in Moonshot's lineup at a fraction of the price of Claude Opus 4.8 and GPT-5.5 — roughly 5–12× cheaper per token, with a 256K context window, a HighSpeed endpoint for latency-sensitive agentic loops, and open weights you can audit or self-host on your own hardware. For teams running high-volume coding pipelines, IDE assistants and long-horizon engineering tasks, the cost curve makes K2.7 Code the obvious default to put against Claude and GPT on your real workload before deciding.

Three steps to start a Kimi K2.7 Code session

Launch Overchat AI

Head to overchat.ai or install the mobile app, then pick Kimi K2.7 Code from the model menu.

Send your first prompt

Drop in a coding task, paste a repository or attach files — PDFs, DOCX, slide decks, images — and Kimi K2.7 Code will pull them into context and start working.

Let it run

Keep pushing tasks at it — refactors, research, drafts, migrations — and let its long-horizon agents carry the work.

Get Started

Why use Kimi K2.7 Code?

Kimi K2.7 Code is Moonshot AI's coding-focused open-weight release, launched on June 12, 2026 as the successor to K2.6. It belongs to the Kimi K2 family — a lineup of native multimodal mixture-of-experts models tuned for agentic coding, long-context reasoning and autonomous multi-step work. Overchat AI provides direct access without a Moonshot account or a Chinese phone number.

Moonshot AI, the team behind Kimi, was in Beijing in March 2023 by Yang Zhilin, Zhou Xinyu, and Wu Yuxin — all Tsinghua alumni. Yang earned his PhD at Carnegie Mellon and worked on NLP at Meta and Google Brain before co-founding the lab, which has since become one of China's most visible open-source AI labs.

Kimi K2.7 Code is Moonshot's strongest coding release to date, with substantial gains over K2.6 on long-horizon engineering tasks and agentic benchmarks — at a fraction of the per-token price of Claude Opus 4.8 and GPT-5.5.

What are the main features of Kimi K2.7 Code?

A 256K-token context window lets Kimi K2.7 Code reason over long codebases, multi-file pull requests or extended research archives in a single pass. Combined with multi-head latent attention and the K2-family mixture-of-experts routing, the model keeps inference costs in check even when the conversation stretches across hundreds of thousands of tokens.

Against the previous Kimi K2.5, the 2.6 release lands larger jumps on coding evals, longer autonomous runs, and a smoother agent swarm that scales further without drifting off-task.stronger reasoning, better factual accuracy, and noticeably improved writing quality.

Kimi K2.7 Code ships with open weights under a Modified MIT license, so you can pull the weights from Hugging Face and run Kimi K2.7 Code on your own hardware. If you'd rather skip the setup, Overchat AI lets you chat with it instantly — no Moonshot API key, no platform.moonshot.ai account, no China-region phone number required.

Moonshot ships Kimi K2.7 Code in two endpoints: the standard kimi-k2.7-code for full reasoning quality, and kimi-k2.7-code-highspeed for latency-sensitive workloads at roughly six times the token throughput. Both endpoints use the same underlying weights and both accept OpenAI- and Anthropic-compatible API calls.

What makes Moonshot stand out is how much it leans into safety and transparencyOn Moonshot's own evaluation suite, Kimi K2.7 Code posts +21.8% on Kimi Code Bench v2, +11.0% on Program Bench and +31.5% on MLS Bench Lite versus K2.6, plus 81.1 on MCP Mark Verified for correct tool invocation through the Model Context Protocol. Independent third-party numbers on SWE-bench Verified, SWE-bench Pro, Terminal-Bench and LiveCodeBench are not yet available, so vendor benchmarks should be treated as directional until reproduced.perfect for critical tasks where you need a strong coding partner you can audit end to end.

Kimi K2.7 Code Benchmars

Moonshot reports roughly +10% over K2.6, with notably more reliable instruction-following over long contexts and higher end-to-end completion rates on multi-step engineering tasks. Against Claude Opus 4.8 (SWE-bench Verified 88.6%) and GPT-5.5 (Terminal-Bench 82.7%), Kimi K2.7 Code's strongest lever is price-performance: roughly 5–12× cheaper per token, with comparable agentic completion rates on long-horizon coding tasks at a fraction of the latency thanks to the HighSpeed endpoint.

Ask Kimi K2.7 Code

Kimi K2.7 — a coding-focused model in Moonshot AI's Kimi K2 family.

What is Moonshot Kimi K2.7 Code?

Trillion-parameter MoE, 32B active

HighSpeed mode at ~6× throughput

30% fewer thinking tokens