GLM-5.1 hits Opus 4.6 agent performance at a third the cost

OpenClaw Arena benchmarks tell a story worth watching: GLM-5.1 matches Opus 4.6's agentic performance at roughly one-third the cost. OpenClaw runs real agents on real tasks, things like web browsing, form filling, and file operations. These aren't synthetic benchmarks. The numbers reflect actual production behavior. GLM-5.1 comes from Zhipu AI, the Chinese company behind the GLM model family that started at Tsinghua University. Their architecture mixes autoregressive and autoencoding methods through what they call 'autoregressive blank infilling,' which probably explains the efficiency. How exactly they hit this cost-performance ratio isn't public. If you're running AI agents and cost matters, GLM-5.1 deserves a look. The gap with expensive Western models has gotten pretty thin for agentic workloads. The pricing advantage is real. One more thing from the Hacker News thread: keep an eye on Mythos too, though details there are sparse.