GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs(opens in new tab)
Modalities
In / Out Price
$0.13 / $0.85per 1M
Context
131K
Weekly Rank
#45on OpenRouter
Knowledge Cutoff
Dec 31, 2024