Model page
GLM 4.7 Fast
Z.ai's GLM-4.7 routed through Cerebras chat completions for lower-latency coding and agentic work. 131k context. Does not support web search or image input. Cerebras currently lists it as a preview model.
Key dates
We use concrete dates and keep them consistent across the site.
Capability
Intelligence
High
Capability
Speed
Fast
Capability
Context window
131,000 tokens
Capability
Max output
40,000 tokens
Modalities
Input and output
Input: Text
Output: Text
Features
Availability notes
Function calling supported · 2 premium requests per send · Cerebras preview model
ChatGPT
GPT‑4o retirement (ChatGPT)
OpenAI is retiring GPT‑4o inside ChatGPT on February 13, 2026. This does not automatically imply API removal.