Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.
Modalities
In / Out Price
$0.1365 / $0.4095per 1M
Context
131K
Weekly Rank
#269on OpenRouter
Knowledge Cutoff
Mar 31, 2025
Going away May 13, 2026
