Of all the Chinese AI labs, Zhipu AI — now operating internationally as Z.ai — may be the most strategically significant. Its GLM-5.1 model topped a global coding leaderboard, ships under the permissive MIT license, costs a fraction of Western alternatives, and was trained entirely on Huawei chips, without a single Nvidia GPU. That last fact makes GLM a statement about the future of AI independence as much as a product. Here’s the full picture.
Principaux enseignements
- GLM-5.1 (March 2026, open weights April 7) is a 744B-parameter MoE under the MIT license.
- Topped SWE-Bench Pro at 58.4, nudging past GPT-5.4 (57.7) and Claude Opus 4.6 (57.3) — though some scores are self-reported.
- Radically cheap: ~$0.98/$3.08 per million tokens; a coding plan from $3-$30/mo vs $100-$200 for Claude Max.
- Trained entirely on Huawei Ascend chips — no Nvidia hardware, a major proof point for AI without US silicon.
- Meilleur pour : cost-conscious teams wanting a near-Opus open model they can self-host.
Who is Zhipu / Z.ai
Zhipu AI is a Beijing lab spun out of Tsinghua University, one of China’s most prestigious institutions. It’s among the original “AI tiger” startups and has positioned itself as the enterprise- and developer-focused alternative to consumer plays like Doubao. In 2026 it rebranded internationally as Z.ai, signaling a push for global developers.
Its model family is GLM (General Language Model). Where DeepSeek competes on price and Kimi on agentic coding, Zhipu’s pitch is “90%+ of Claude’s quality at a tenth of the cost, fully open, and built on a sovereign hardware stack.” That last part is the differentiator nobody else can claim as cleanly.
What GLM-5.1 actually is
GLM-5.1, released March 27, 2026 with open weights following on April 7, is a post-training upgrade to the GLM-5 base — the same ~744-billion-parameter Mixture-of-Experts architecture, with significantly enhanced coding, tool use, and autonomous execution. It supports a 200K context window, 128K max output, thinking mode, function calling, structured output, context caching, and native MCP integration.
Critically, it’s released under the MIT license on Hugging Face — download, modify, fine-tune, and deploy commercially with no restrictions or royalties. Combined with strong capability, that makes GLM-5.1 one of the most genuinely useful open models available.
The benchmark — with an honest asterisk
On SWE-Bench Pro, GLM-5.1 reportedly scored 58.4, topping the global leaderboard and edging past GPT-5.4 (57.7) and Claude Opus 4.6 (57.3). Zhipu also claims a coding score of 45.3 — about 94.6% of Claude Opus 4.6’s performance.
Here’s the honest caveat, and it matters: some of these headline numbers are self-reported by Z.ai, and as of early 2026 independent labs had not fully corroborated the most flattering coding figures. The model is clearly excellent — multiple third-party reviews confirm it’s near-Opus on real work — but treat the exact “94.6% of Opus” claim as a vendor figure, not gospel. The practical takeaway holds: GLM-5.1 delivers most of Claude’s quality for routine work at a tiny fraction of the price.
The Huawei angle — why GLM matters beyond benchmarks
The single most significant fact about GLM isn’t a benchmark — it’s the hardware. Zhipu trained the entire GLM-5 family exclusively on Huawei Ascend 910B accelerators, with no Nvidia GPUs involved.
This is a landmark. US export controls have tried to choke China’s access to Nvidia’s best chips precisely to slow Chinese frontier AI. GLM-5.1 is living proof that a competitive, leaderboard-topping frontier model can be built entirely on domestic Chinese silicon. Whatever you think of the geopolitics, it reshapes the strategic picture: the hardware chokepoint is leakier than assumed.
Where GLM wins
1. Claude quality at 1/30th the cost
The GLM Coding Plan starts at $3-$30/month versus $100-$200/month for Claude Max. If GLM delivers 90%+ of Opus quality for your routine coding — and for many teams it does — the savings are transformative.
2. MIT-licensed open weights
Like DeepSeek, GLM ships its best model fully open. Self-host, fine-tune, air-gap — total control, no royalties.
3. Genuinely strong agentic coding
GLM-5.1’s enhancements target tool use and autonomous execution, with native MCP support. It’s built for the agent era, not just chat.
4. Hardware sovereignty
For organizations (especially in China and allied markets) that want to avoid dependence on US silicon and software, GLM is the clearest path — and that strategic appeal is real regardless of benchmarks.
Where GLM loses — the honest caveats
1. Some benchmarks are self-reported
The most flattering numbers come from Z.ai itself, without full independent corroboration. The model is excellent, but discount the exact vendor claims and test on your own workload.
2. Hosted-API caveats
The Z.ai API carries the usual China data-residency and content-moderation considerations. The MIT weights let you self-host to avoid both.
3. Smaller context than rivals
200K context is solid but trails DeepSeek and Qwen’s 1M windows. For very long-document or whole-codebase work, that’s a real limitation.
4. Ecosystem still maturing
Z.ai’s international developer experience is newer than Alibaba’s or the US labs’. Improving quickly, but not yet at parity.
GLM vs the field
| Dimension | GLM-5.1 | DeepSeek V4 | Kimi K2.6 | Claude Opus 4.8 |
|---|---|---|---|---|
| Open weights | Yes (MIT) | Yes (MIT) | Oui | Non |
| Coding (SWE-Bench Pro) | 58.4 | ~58 | 58.6 | Frontier |
| Prix | ~$0.98/$3.08 | ~$0.44/$0.87 | ~$0.60/$2.50 | ~$5/$25 |
| Context window | 200K | 1M | 262K | 1M |
| Hardware story | Huawei-only | Nvidia | Nvidia | Nvidia/TPU |
Pros and cons
GLM pros
- Near-Claude quality at a fraction of the price
- MIT-licensed open weights — self-hostable
- Strong agentic coding with native MCP support
- Trained entirely on Huawei chips (sovereign stack)
- Coding plan from $3-$30/month
GLM cons
- Headline benchmarks partly self-reported
- 200K context trails 1M rivals
- Hosted API has China data/moderation caveats
- International ecosystem still maturing
How to access GLM
- Hosted API / coding plan: z.ai (formerly open.bigmodel.cn) — cheapest direct option, including the $3-$30/mo GLM Coding Plan.
- Western hosts: OpenRouter and others serve GLM-5.1 with non-China data residency.
- Self-host: download GLM-5.1 weights from Hugging Face (MIT) and run on your own hardware.
FAQ
Is GLM-5.1 really as good as Claude?
For routine coding and general work, multiple reviews put it at roughly 90-95% of Claude Opus 4.6’s quality — at a tiny fraction of the cost. For the most demanding frontier reasoning, Claude Opus 4.8 (newer) still leads. The honest framing: GLM gives you most of Claude’s value for a fraction of the price, with the caveat that the most flattering benchmark numbers are vendor-reported.
What does the “no Nvidia” training mean for me?
Practically, nothing about using the model. Strategically, it proves competitive frontier models can be built on non-US hardware — which matters for anyone thinking about long-term AI supply-chain risk and the effectiveness of chip export controls.
Is GLM open source?
Yes — GLM-5.1 weights are on Hugging Face under the MIT license, one of the most permissive available. You can use it commercially with no restrictions.
Who is Z.ai?
Z.ai is the international brand of Zhipu AI, a Beijing lab spun out of Tsinghua University. The rebrand in 2026 reflects a push to serve global developers.
How does the GLM Coding Plan compare to Claude Max?
GLM’s coding plan runs $3-$30/month; Claude Max is $100-$200/month. If GLM covers your routine work at acceptable quality — and for many developers it does — that’s a 5-30x cost reduction. Many teams now use GLM for bulk coding and reserve Claude for the hardest tasks.
Is GLM-5.1 free to use?
Yes — the GLM-5.1 weights are released under the permissive MIT license, so you can download, self-host, fine-tune, and use them commercially for free (you pay only for compute). Z.ai’s hosted API is paid but very cheap, including a GLM Coding Plan from $3-$30/month that undercuts Claude Max many times over.
Is GLM-5.1 better than DeepSeek V4?
They’re close, with different strengths. GLM-5.1 is tuned for agentic coding and tops some coding leaderboards, and its all-Huawei training is a unique strategic angle. DeepSeek V4 is cheaper still, has a larger 1M context window (vs GLM’s 200K), and is a stronger all-rounder. For routine coding on a budget both are excellent; for the longest-context work, DeepSeek edges it.
Résultat
GLM-5.1 is the most strategically interesting model in Chinese AI. It delivers near-Claude coding quality, ships as MIT-licensed open weights, costs a fraction of Western alternatives, and — uniquely — proves that a frontier-competitive model can be trained entirely on Chinese silicon.
The honest caveats keep it grounded: discount the self-reported benchmark peaks, note the 200K context ceiling, and route around the hosted API for sensitive data by self-hosting the open weights. Do that, and GLM-5.1 is one of the best value propositions in AI — and, with its all-Huawei training, the clearest sign yet that the global AI landscape is no longer one the US can control through hardware alone.
