Model page

GLM 4.7 Fast

Z.ai's GLM-4.7 routed through Cerebras chat completions for lower-latency coding and agentic work. 131k context. Does not support web search or image input. Cerebras currently lists it as a preview model.

Key dates

We use concrete dates and keep them consistent across the site.

Capability

Intelligence

High

Capability

Speed

Fast

Capability

Context window

131,000 tokens

Capability

Max output

40,000 tokens

Modalities

Input and output

Input: Text
Output: Text

Features

Availability notes

Function calling supported · 2 premium requests per send · Cerebras preview model

ChatGPT

GPT‑4o retirement (ChatGPT)

OpenAI is retiring GPT‑4o inside ChatGPT on February 13, 2026. This does not automatically imply API removal.