Loading...

Model Evolution

One challenge, every Zhipu AI generation.

Sign in

Zhipu AI

Builds the GLM family of bilingual (Chinese and English) language models.

Total Models

7

Text Models

7

Active Period

Jul 2025 to Jun 2026

Spun out of Tsinghua University's KEG Lab.

GLM family uses a unique autoregressive blank-infilling architecture.

Strong bilingual (Chinese/English) and code generation capabilities.

Rapid iteration from GLM-4 through GLM-5 with competitive benchmarks.

Compare Zhipu AI Models

Z.ai: GLM 5.2

Jun 2026

GLM-5.2 is Z.ai's flagship model built for the era of long-horizon tasks. With a genuinely usable 1M-token context window, it holds project-level engineering context, executes long-running tasks more reliably, follows engineering standards more consistently, and can carry a project from requirements all the way to multi-platform deployment in a single task.

conversationreasoningcode-generationanalysisagentic-tool-usetool-useplanning

Z.ai: GLM 5

Feb 2026

GLM-5 is Z.ai's flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading closed-source models. With advanced agentic planning, deep backend reasoning, and iterative self-correction, GLM-5 moves beyond code generation to full-system construction and autonomous execution.

conversationreasoningcode-generationanalysisagentic-tool-usetool-use

GLM 4.7 Flash

Jan 2026

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

conversationreasoningcode-generationanalysis

Z.AI: GLM 4.6

Sep 2025

GLM 4.6 expands the GLM family with a 200K-token context window, stronger coding benchmarks, and more reliable multi-step reasoning. It integrates deeply with agent frameworks to orchestrate tool use and produces more natural writing for long-form chat.

conversationreasoningcode-generationanalysistool-use

Z.AI: GLM 4.5

Jul 2025

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses.

conversationreasoningcode-generationanalysis

Z.AI: GLM 4.5 Air

Jul 2025

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean.

conversationreasoninganalysis

Z.AI: GLM 4 32B

Jul 2025

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It is made by the same lab behind the thudm models.

conversationreasoningcode-generationanalysistool-use

Loading...

Model Evolution

One challenge, every Zhipu AI generation.

Zhipu AI

Builds the GLM family of bilingual (Chinese and English) language models.

Total Models

7

Text Models

7

Active Period

Jul 2025 to Jun 2026

Spun out of Tsinghua University's KEG Lab.

GLM family uses a unique autoregressive blank-infilling architecture.

Strong bilingual (Chinese/English) and code generation capabilities.

Rapid iteration from GLM-4 through GLM-5 with competitive benchmarks.

Compare Zhipu AI Models

Z.ai: GLM 5.2

Jun 2026

GLM-5.2 is Z.ai's flagship model built for the era of long-horizon tasks. With a genuinely usable 1M-token context window, it holds project-level engineering context, executes long-running tasks more reliably, follows engineering standards more consistently, and can carry a project from requirements all the way to multi-platform deployment in a single task.

conversationreasoningcode-generationanalysisagentic-tool-usetool-useplanning

Z.ai: GLM 5

Feb 2026

GLM-5 is Z.ai's flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading closed-source models. With advanced agentic planning, deep backend reasoning, and iterative self-correction, GLM-5 moves beyond code generation to full-system construction and autonomous execution.

conversationreasoningcode-generationanalysisagentic-tool-usetool-use

GLM 4.7 Flash

Jan 2026

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

conversationreasoningcode-generationanalysis

Z.AI: GLM 4.6

Sep 2025

GLM 4.6 expands the GLM family with a 200K-token context window, stronger coding benchmarks, and more reliable multi-step reasoning. It integrates deeply with agent frameworks to orchestrate tool use and produces more natural writing for long-form chat.

conversationreasoningcode-generationanalysistool-use

Z.AI: GLM 4.5

Jul 2025

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses.

conversationreasoningcode-generationanalysis

Z.AI: GLM 4.5 Air

Jul 2025

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean.

conversationreasoninganalysis

Z.AI: GLM 4 32B

Jul 2025

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It is made by the same lab behind the thudm models.

conversationreasoningcode-generationanalysistool-use

Loading...

Model Evolution

One challenge, every Zhipu AI generation.

Zhipu AI

Builds the GLM family of bilingual (Chinese and English) language models.

Total Models

7

Text Models

7

Active Period

Jul 2025 to Jun 2026

Spun out of Tsinghua University's KEG Lab.

GLM family uses a unique autoregressive blank-infilling architecture.

Strong bilingual (Chinese/English) and code generation capabilities.

Rapid iteration from GLM-4 through GLM-5 with competitive benchmarks.

Compare Zhipu AI Models

Z.ai: GLM 5.2

Jun 2026

GLM-5.2 is Z.ai's flagship model built for the era of long-horizon tasks. With a genuinely usable 1M-token context window, it holds project-level engineering context, executes long-running tasks more reliably, follows engineering standards more consistently, and can carry a project from requirements all the way to multi-platform deployment in a single task.

conversationreasoningcode-generationanalysisagentic-tool-usetool-useplanning

Z.ai: GLM 5

Feb 2026

GLM-5 is Z.ai's flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading closed-source models. With advanced agentic planning, deep backend reasoning, and iterative self-correction, GLM-5 moves beyond code generation to full-system construction and autonomous execution.

conversationreasoningcode-generationanalysisagentic-tool-usetool-use

GLM 4.7 Flash

Jan 2026

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

conversationreasoningcode-generationanalysis

Z.AI: GLM 4.6

Sep 2025

GLM 4.6 expands the GLM family with a 200K-token context window, stronger coding benchmarks, and more reliable multi-step reasoning. It integrates deeply with agent frameworks to orchestrate tool use and produces more natural writing for long-form chat.

conversationreasoningcode-generationanalysistool-use

Z.AI: GLM 4.5

Jul 2025

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses.

conversationreasoningcode-generationanalysis

Z.AI: GLM 4.5 Air

Jul 2025

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean.

conversationreasoninganalysis

Z.AI: GLM 4 32B

Jul 2025

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It is made by the same lab behind the thudm models.

conversationreasoningcode-generationanalysistool-use

Zhipu AI

Builds the GLM family of bilingual (Chinese and English) language models.

Total Models

7

Text Models

7

Active Period

Jul 2025 to Jun 2026

Spun out of Tsinghua University's KEG Lab.

GLM family uses a unique autoregressive blank-infilling architecture.

Strong bilingual (Chinese/English) and code generation capabilities.

Rapid iteration from GLM-4 through GLM-5 with competitive benchmarks.

Compare Zhipu AI Models

Z.ai: GLM 5.2

Jun 2026

GLM-5.2 is Z.ai's flagship model built for the era of long-horizon tasks. With a genuinely usable 1M-token context window, it holds project-level engineering context, executes long-running tasks more reliably, follows engineering standards more consistently, and can carry a project from requirements all the way to multi-platform deployment in a single task.

conversationreasoningcode-generationanalysisagentic-tool-usetool-useplanning

Z.ai: GLM 5

Feb 2026

GLM-5 is Z.ai's flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading closed-source models. With advanced agentic planning, deep backend reasoning, and iterative self-correction, GLM-5 moves beyond code generation to full-system construction and autonomous execution.

conversationreasoningcode-generationanalysisagentic-tool-usetool-use

GLM 4.7 Flash

Jan 2026

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

conversationreasoningcode-generationanalysis

Z.AI: GLM 4.6

Sep 2025

GLM 4.6 expands the GLM family with a 200K-token context window, stronger coding benchmarks, and more reliable multi-step reasoning. It integrates deeply with agent frameworks to orchestrate tool use and produces more natural writing for long-form chat.

conversationreasoningcode-generationanalysistool-use

Z.AI: GLM 4.5

Jul 2025

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses.

conversationreasoningcode-generationanalysis

Z.AI: GLM 4.5 Air

Jul 2025

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean.

conversationreasoninganalysis

Z.AI: GLM 4 32B

Jul 2025

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It is made by the same lab behind the thudm models.

conversationreasoningcode-generationanalysistool-use