GLM 4.5V

Z.ai · z-ai/glm-4.5v

← Back to leaderboard

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

open weightsimagetexttext+image->text

Context

Max context: 65536
Max output: 16384

Pricing

Input / 1M: 0.60
Output / 1M: 1.80
Blend / 1M: 1.20

Quality

Quality index:

Provider

Provider: Z.ai
Moderated: no