{"id":1179,"date":"2026-06-19T16:39:14","date_gmt":"2026-06-19T16:39:14","guid":{"rendered":"https:\/\/convly.ai\/kimi-k2-7-code-explained-2026\/"},"modified":"2026-06-19T16:39:32","modified_gmt":"2026-06-19T16:39:32","slug":"kimi-k2-7-code-explained-2026","status":"publish","type":"post","link":"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/","title":{"rendered":"Explication du code Kimi K2.7 : le mod\u00e8le de codage ouvert 1T de Moonshot"},"content":{"rendered":"<p>Moonshot AI released Kimi K2.7 Code on June 12, 2026, and the name matters more than usual. This is not a new general chatbot called &#8220;Kimi 2.7.&#8221; It is a coding-only model: a 1-trillion-parameter Mixture-of-Experts system tuned specifically to plan, edit files, run tools, and grind through multi-step software tasks. For ordinary chat, Moonshot still points you at the older K2.6.<\/p>\n<p>The pitch is efficiency. K2.7 Code claims higher coding scores than K2.6 while burning roughly 30% fewer reasoning tokens, and it lists at $0.95 per million input tokens and $4.00 per million output. That is a fraction of what frontier closed models charge. The weights are public under a Modified MIT license, so you can also run it yourself \u2014 if you have the hardware for a model that lands at about 595 GB on disk even in its native 4-bit form. Here is what is real, what is vendor-reported, and where it fits.<\/p>\n<div class=\"convly-tldr\">\n<h3>Principaux enseignements<\/h3>\n<ul>\n<li><strong>Coding-only, not a chatbot.<\/strong> &#8220;K2.7 Code&#8221; is a dedicated agentic coding model; Moonshot recommends K2.6 for general use.<\/li>\n<li><strong>1T MoE, 32B active.<\/strong> 384 experts (8 routed + 1 shared), 61 layers, 256K context, 160K vocab, MLA attention, plus a 400M-param MoonViT vision encoder for image and video input.<\/li>\n<li><strong>Thinking is mandatory.<\/strong> There is no non-thinking mode; disabling it returns an API error.<\/li>\n<li><strong>Vendor-reported gains over K2.6:<\/strong> +21.8% Kimi Code Bench v2, +11.0% Program Bench, +31.5% MLS Bench Lite, with ~30% fewer reasoning tokens.<\/li>\n<li><strong>Aggressive pricing:<\/strong> $0.95 in \/ $4.00 out per million tokens, with cache hits near $0.19 \u2014 roughly 6x under Claude Opus 4.8 and up to ~12x under Claude Fable 5 on output.<\/li>\n<li><strong>Open weights, heavy hardware.<\/strong> Modified MIT license on Hugging Face; the weights ship natively in int4 (~595 GB), and realistic local inference still needs roughly 8 80GB-class GPUs (~640 GB VRAM).<\/li>\n<\/ul>\n<\/div>\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_84 counter-flat ez-toc-counter ez-toc-container-direction\">\n<label for=\"ez-toc-cssicon-toggle-item-6a35faa42d496\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #000000;color:#000000\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewbox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #000000;color:#000000\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewbox=\"0 0 24 24\" version=\"1.2\" baseprofile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a35faa42d496\"  aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1' ><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#What_Kimi_K27_Code_actually_is\" >What Kimi K2.7 Code actually is<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#Specs_and_architecture\" >Specs and architecture<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#The_benchmark_gains_read_honestly\" >The benchmark gains, read honestly<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#Pricing_and_value\" >Pricing and value<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#How_to_use_it_API_vs_running_the_weights\" >How to use it: API vs running the weights<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#How_it_compares_to_K26_and_rivals\" >How it compares to K2.6 and rivals<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#FAQ\" >FAQ<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#Bottom_line\" >R\u00e9sultat<\/a><\/li><li class='ez-toc-page-1'><a class=\"ez-toc-link ez-toc-heading-9\" href=\"https:\/\/convly.ai\/fr\/kimi-k2-7-code-explained-2026\/#Related_articles\" >Articles connexes<\/a><\/li><\/ul><\/nav><\/div>\n<h2><span class=\"ez-toc-section\" id=\"What_Kimi_K27_Code_actually_is\"><\/span>What Kimi K2.7 Code actually is<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>K2.7 Code is the latest in Moonshot&#8217;s fast-moving Kimi line, and it is the first the company has split off as a coding-specialized release rather than a general model with a coding mode. The design goal is long-horizon software engineering: the kind of work where an agent reads a repo, plans a change, edits several files, runs a build, reads the error, and iterates. It is built to act, not to converse.<\/p>\n<p>That focus shows up in the defaults. The model always runs with &#8220;thinking&#8221; enabled \u2014 there is no way to turn it off, and the API rejects requests that try. The bet is that for agentic coding, the reasoning traces are worth their cost, and that K2.7&#8217;s efficiency gains keep that cost in check. If you want a model that just answers a quick question cheaply, Moonshot itself says to use K2.6 instead. We cover the broader family in our <a href=\"\/fr\/moonshot-kimi-explained-2026\/\">Moonshot Kimi explainer<\/a>.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Specs_and_architecture\"><\/span>Specs and architecture<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The architecture is a sparse MoE. Of the 1 trillion total parameters, only about 32 billion activate per token, which is what keeps inference cost and latency far below what a 1T dense model would imply.<\/p>\n<table class=\"convly-vs\">\n<thead>\n<tr>\n<th>Spec<\/th>\n<th>Kimi K2.7 Code<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Nombre total de param\u00e8tres<\/td>\n<td>1 trillion (MoE)<\/td>\n<\/tr>\n<tr>\n<td>Actif par jeton<\/td>\n<td>~32 billion<\/td>\n<\/tr>\n<tr>\n<td>Experts<\/td>\n<td>384 (8 routed + 1 shared)<\/td>\n<\/tr>\n<tr>\n<td>Layers<\/td>\n<td>61 (1 dense)<\/td>\n<\/tr>\n<tr>\n<td>Fen\u00eatre contextuelle<\/td>\n<td>256K tokens (262,144)<\/td>\n<\/tr>\n<tr>\n<td>Vocabulary<\/td>\n<td>160K<\/td>\n<\/tr>\n<tr>\n<td>Attention<\/td>\n<td>MLA (Multi-head Latent Attention)<\/td>\n<\/tr>\n<tr>\n<td>Modality<\/td>\n<td>Text, image, video (via 400M MoonViT encoder)<\/td>\n<\/tr>\n<tr>\n<td>Native precision<\/td>\n<td>INT4 (MoE weights), BF16 attention<\/td>\n<\/tr>\n<tr>\n<td>Thinking mode<\/td>\n<td>Mandatory (cannot disable)<\/td>\n<\/tr>\n<tr>\n<td>Licence<\/td>\n<td>Modified MIT (open weights)<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The native multimodal input is a genuine differentiator for a coding model. You can hand it a screenshot of a broken UI, a diagram, or a short screen recording alongside the code. Most coding-focused open models are text-only, so this widens the practical use cases \u2014 debugging from a screenshot, implementing from a mockup \u2014 without a separate vision pipeline.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"The_benchmark_gains_read_honestly\"><\/span>The benchmark gains, read honestly<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Moonshot&#8217;s headline numbers compare K2.7 Code to K2.6 on its own internal suites. These are vendor-reported and use Moonshot&#8217;s benchmarks, so treat them as directional rather than neutral ground truth.<\/p>\n<table class=\"convly-vs\">\n<thead>\n<tr>\n<th>Benchmark (vendor-reported)<\/th>\n<th>K2.6<\/th>\n<th>K2.7 Code<\/th>\n<th>Change<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Kimi Code Bench v2<\/td>\n<td>50.9<\/td>\n<td>62.0<\/td>\n<td>+21.8%<\/td>\n<\/tr>\n<tr>\n<td>Program Bench<\/td>\n<td>48.3<\/td>\n<td>53.6<\/td>\n<td>+11.0%<\/td>\n<\/tr>\n<tr>\n<td>MLS Bench Lite<\/td>\n<td>26.7<\/td>\n<td>35.1<\/td>\n<td>+31.5%<\/td>\n<\/tr>\n<tr>\n<td>MCPMark Verified<\/td>\n<td>72.8<\/td>\n<td>81.1<\/td>\n<td>+11.4%<\/td>\n<\/tr>\n<tr>\n<td>Reasoning tokens used<\/td>\n<td>baseline<\/td>\n<td>~30% fewer<\/td>\n<td>more efficient<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>On agent-tool benchmarks (MCP Atlas, MCPMark Verified, Kimi&#8217;s own Claw 24\/7), Moonshot reports gains of roughly 10% over K2.6 \u2014 smaller, but in the right direction.<\/p>\n<p>Independent data is starting to land. Artificial Analysis, which runs its own measurements rather than republishing vendor claims, places K2.7 Code at 42 on its composite Intelligence Index, ranking it around #6 among the open-weight models it tracks. It clocks output at about 55.8 tokens per second with a ~2.25-second time to first token on Moonshot&#8217;s standard API \u2014 respectable, not record-setting, and the mandatory thinking mode means real-world latency on a full agent task is higher than that first-token number suggests. (Moonshot also offers a separate high-speed endpoint that runs far faster, but the headline model is the one benchmarked here.)<\/p>\n<p>The most useful third-party comparison comes from head-to-head coding tests. On MCPMark Verified, an agent-tool benchmark, K2.7 Code scores 81.1, edging out Claude Opus 4.8 at 76.4 \u2014 but GPT-5.5 sits well ahead at 92.9. On Moonshot&#8217;s own Program Bench, GPT-5.5 leads 69.1 to 53.6. The honest summary: K2.7 Code is competitive with frontier models on some agentic-tool tasks and clearly behind on others. It is not the new state of the art. Its case rests on price.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Pricing_and_value\"><\/span>Pricing and value<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>This is where K2.7 Code makes noise. Here is the published API pricing against the current closed frontier, per million tokens.<\/p>\n<table class=\"convly-vs\">\n<thead>\n<tr>\n<th>Mod\u00e8le<\/th>\n<th>Input<\/th>\n<th>Output<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Kimi K2.7 Code<\/td>\n<td>$0.95<\/td>\n<td>$4.00<\/td>\n<\/tr>\n<tr>\n<td>Claude Opus 4.8<\/td>\n<td>$5.00<\/td>\n<td>$25.00<\/td>\n<\/tr>\n<tr>\n<td>GPT-5.5<\/td>\n<td>$5.00<\/td>\n<td>$30.00<\/td>\n<\/tr>\n<tr>\n<td>Claude Fable 5<\/td>\n<td>$10.00<\/td>\n<td>$50.00<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>On output, K2.7 Code is roughly 6x cheaper than Opus 4.8 and more than 12x cheaper than Fable 5. Cache hits cost around $0.19 per million input tokens, which matters a lot for agents that re-read the same files repeatedly. Combine that with ~30% fewer reasoning tokens per task, and the effective cost gap widens further.<\/p>\n<p>The trade is straightforward: lower raw capability per call, but the same budget buys many more calls. For high-volume agentic workloads \u2014 CI bots, bulk refactors, test generation, automated triage \u2014 running K2.7 Code several times and keeping the best result can beat one expensive frontier call. For a single, subtle architectural decision, the frontier model&#8217;s higher hit rate may still be worth the premium. If you are weighing options across the field, our roundup of the <a href=\"\/fr\/best-ai-coding-assistants\/\">best AI coding assistants<\/a> puts this in context.<\/p>\n<div class=\"convly-procons\">\n<div class=\"pros\">\n<h4>Points forts<\/h4>\n<ul>\n<li>Open weights under a permissive Modified MIT license<\/li>\n<li>Very low per-token cost with cheap cache hits<\/li>\n<li>Native image and video input, rare for a coding model<\/li>\n<li>256K context suits whole-repo agentic work<\/li>\n<li>~30% reasoning-token reduction trims agent bills<\/li>\n<\/ul>\n<\/div>\n<div class=\"cons\">\n<h4>Limitations<\/h4>\n<ul>\n<li>Trails GPT-5.5 on multiple coding benchmarks<\/li>\n<li>Mandatory thinking mode adds latency and rules out fast non-reasoning calls<\/li>\n<li>Local hosting needs data-center-class GPUs<\/li>\n<li>Headline gains are vendor-reported on internal suites<\/li>\n<li>Not recommended for general chat \u2014 narrow by design<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h2><span class=\"ez-toc-section\" id=\"How_to_use_it_API_vs_running_the_weights\"><\/span>How to use it: API vs running the weights<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The easy path is the API. K2.7 Code is available through Moonshot&#8217;s Kimi API and its Kimi Code CLI, and it speaks the standard tool-calling conventions, so it drops into most existing agent setups. If you build on agent scaffolding, see our guide to the <a href=\"\/fr\/best-ai-agent-frameworks-2026\/\">best AI agent frameworks<\/a> for where a model like this slots in.<\/p>\n<p>Running the open weights is a different story, and this is the part to be clear-eyed about. Like Kimi K2 Thinking before it, K2.7 Code ships pre-quantized in native int4 \u2014 the MoE weights are stored at 4-bit via quantization-aware training while attention stays in BF16 \u2014 which is why the Hugging Face release lands at roughly 595 GB on disk rather than the ~2 TB a full BF16 copy of a 1T-parameter model would need. (A full-precision BF16 build is not what Moonshot distributes.) Serving is supported through vLLM, SGLang, and KTransformers.<\/p>\n<table class=\"convly-vs\">\n<thead>\n<tr>\n<th>Setup<\/th>\n<th>Reality<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>~8x 80GB-class GPUs (\u2248640 GB VRAM), native int4<\/td>\n<td>Recommended full-context production setup (\u22485x H200 is a rough equivalent)<\/td>\n<\/tr>\n<tr>\n<td>4x RTX 4090 (96 GB), with CPU\/RAM offload<\/td>\n<td>Possible, but context capped ~64K\u2013128K and much lower throughput<\/td>\n<\/tr>\n<tr>\n<td>Single consumer GPU<\/td>\n<td>Not viable for the full model<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>In short, &#8220;open weights&#8221; does not mean &#8220;runs on your laptop.&#8221; Even at native 4-bit the weights alone exceed half a terabyte, so for most teams the API is the sensible route, and self-hosting is for organizations with serious GPU budgets or strict data-residency needs. If local is a hard requirement, weigh smaller options in our <a href=\"\/fr\/best-local-llm-for-coding-2026\/\">best local LLM for coding<\/a> guide, which covers models that fit real hardware.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"How_it_compares_to_K26_and_rivals\"><\/span>How it compares to K2.6 and rivals<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Against K2.6, K2.7 Code is the better tool for sustained, multi-step coding agents and the worse tool for everything else \u2014 Moonshot&#8217;s own guidance is to keep K2.6 for general tasks. The split is deliberate: one model optimized for agentic coding, the other for breadth.<\/p>\n<p>Against the broader open field, the obvious 2026 rival is Zhipu&#8217;s GLM-5.2, another large open model chasing the same coding-agent niche; we break that one down in our <a href=\"\/fr\/glm-5-2-explained-2026\/\">GLM-5.2 explainer<\/a>, and pit the two against each other in <a href=\"\/fr\/glm-5-2-vs-kimi-k2-7-for-coding-2026\/\">GLM-5.2 vs Kimi K2.7 for coding<\/a>. A fair head-to-head is still hard to call: Zhipu shipped GLM-5.2 without published benchmark numbers, and neutral third parties have not yet posted directly comparable agentic-coding scores for the two, so any &#8220;winner&#8221; claim today is premature. Against the closed frontier, K2.7 Code is a value play, not a capability leader: you accept a measurable gap to GPT-5.5 in exchange for open weights and a price that can be an order of magnitude lower.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"FAQ\"><\/span>FAQ<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<h3>Is Kimi K2.7 Code a chatbot or a coding model?<\/h3>\n<p>It is a coding-specialized model built for agentic software tasks \u2014 planning, editing files, running tools, and debugging across many steps. It is not positioned as a general chatbot. Moonshot recommends the older K2.6 for general conversation and reserves K2.7 Code for coding work.<\/p>\n<h3>How much does Kimi K2.7 Code cost?<\/h3>\n<p>The API lists $0.95 per million input tokens and $4.00 per million output tokens, with cache hits around $0.19 per million input. That is roughly 6x cheaper than Claude Opus 4.8 on output and over 12x cheaper than Claude Fable 5.<\/p>\n<h3>Can I run Kimi K2.7 Code locally?<\/h3>\n<p>Yes, the weights are public under a Modified MIT license, but it is a 1T-parameter model that takes about 595 GB on disk even in its native int4 format. A realistic production setup needs roughly 8 80GB-class GPUs (~640 GB VRAM) \u2014 about five H200s is a rough equivalent. A 4x RTX 4090 rig can run it only with CPU\/RAM offload, reduced context, and lower throughput, and no single consumer GPU will hold the full model.<\/p>\n<h3>How much better is K2.7 Code than K2.6?<\/h3>\n<p>Moonshot reports +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, +31.5% on MLS Bench Lite, and +11.4% on MCPMark Verified, plus about 30% fewer reasoning tokens per task. These are vendor-reported figures on Moonshot&#8217;s own benchmarks, so treat them as directional.<\/p>\n<h3>Does Kimi K2.7 Code support images?<\/h3>\n<p>Yes. It includes a 400M-parameter MoonViT vision encoder and accepts text, image, and video input. That lets it work from screenshots, diagrams, or short recordings \u2014 unusual for a coding-focused open model.<\/p>\n<h3>Is Kimi K2.7 Code better than GPT-5.5 for coding?<\/h3>\n<p>Not on most benchmarks. GPT-5.5 leads on Program Bench (69.1 vs 53.6) and MCPMark Verified (92.9 vs 81.1). K2.7 Code&#8217;s advantage is cost: the price gap means you can run it far more often for the same budget, which can win on high-volume agentic workloads.<\/p>\n<h3>What is &#8220;thinking mode&#8221; and can I turn it off?<\/h3>\n<p>Thinking mode is the model&#8217;s internal reasoning step before it answers. In K2.7 Code it is mandatory \u2014 there is no non-thinking mode, and the API returns an error if you try to disable it. The efficiency claim is that it now reaches answers using ~30% fewer reasoning tokens than K2.6.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Bottom_line\"><\/span>R\u00e9sultat<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Kimi K2.7 Code is a sharp, deliberately narrow release: an open-weight 1T coding agent that trades a real capability gap to GPT-5.5 for pricing that is hard to argue with and a license that lets you own the model outright. It will not top the leaderboards, and the mandatory thinking mode plus data-center hardware requirement \u2014 over half a terabyte of weights even at native 4-bit \u2014 mean it is not for everyone. But for teams running high-volume agentic coding, where cost per task compounds fast, it is one of the most credible value plays of 2026. Use the API unless you have the GPUs and a reason to self-host, benchmark it on your own repos before committing, and keep K2.6 around for the chat it was never meant to do.<\/p>\n<p><!--related-block--><\/p>\n<div class=\"convly-related\">\n<h2><span class=\"ez-toc-section\" id=\"Related_articles\"><\/span>Articles connexes<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<ul>\n<li><a href=\"https:\/\/convly.ai\/fr\/gpt-5-6-what-we-know-2026\/\">GPT-5.6 : ce que nous savons vs ce qui a fuit\u00e9 (2026)<\/a><\/li>\n<li><a href=\"https:\/\/convly.ai\/fr\/glm-5-2-explained-2026\/\">GLM 5.2 expliqu\u00e9 : le codeur Open 1M-Context de Zhipu<\/a><\/li>\n<li><a href=\"https:\/\/convly.ai\/fr\/ollama-vs-jan-2026\/\">Ollama contre Jan : quelle application locale d'IA l'emportera en 2026 ?<\/a><\/li>\n<li><a href=\"https:\/\/convly.ai\/fr\/lm-studio-complete-guide-2026\/\">LM Studio : Le guide complet (2026)<\/a><\/li>\n<\/ul>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>Moonshot&#8217;s Kimi K2.7 Code is a 1T-parameter open-weight model built only for agentic coding. We verify the specs, the vendor benchmarks, the int4 hardware reality, and whether its aggressive $0.95\/$4.00 pricing actually beats the closed frontier.<\/p>","protected":false},"author":1,"featured_media":1184,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[3],"tags":[769,767,781,750,782,619,766],"class_list":["post-1179","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-llms","tag-agentic-coding","tag-coding-llm","tag-kimi-k2-7-code","tag-llms","tag-moe","tag-moonshot-ai","tag-open-weights"],"_links":{"self":[{"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/posts\/1179","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/comments?post=1179"}],"version-history":[{"count":1,"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/posts\/1179\/revisions"}],"predecessor-version":[{"id":1185,"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/posts\/1179\/revisions\/1185"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/media\/1184"}],"wp:attachment":[{"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/media?parent=1179"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/categories?post=1179"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/convly.ai\/fr\/wp-json\/wp\/v2\/tags?post=1179"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}