Model Details

Domain:

Task:

Quantitative reasoning

Code generation

Model Access:

Open weights (unrestricted)

AI Tools Usage

This model is commonly used behind the scenes in AI tools.

Introduction

Finally, GLM-Z1-9B-0414 is a surprise. We employed all the aforementioned techniques to train a small model (9B). GLM-Z1-9B-0414 exhibits excellent capabilities in mathematical reasoning and general tasks. Its overall performance is top-ranked among all open-source models of the same size. Especially in resource-constrained scenarios, this model achieves an excellent balance between efficiency and effectiveness, providing a powerful option for users seeking lightweight deployment.

Benchmarking

FLOPs8.1e+23

Notes: Assuming it was trained on the same 15T dataset as 32B model: 6 FLOP / parameter / token * 9 * 10^9 parameters * 15 * 10^12 tokens = 8.1e+23 FLOP "Likely" confidence due to the uncertain dataset size

Training

Training Code AccessibilityMIT license https://huggingface.co/THUDM/GLM-4-9B-0414

Parameters

Parameters9000000000

Notes: 9B

Related ModelsView all models

GLM-Z1-Rumination-32B-0414By Tsinghua University

Language

GLM-130BBy Tsinghua University

Language

Top Tasks

Top Countries

Top Domains

Top Organizations

Top Categories

Top Collections

Platform

Top Tasks

Top Countries

Top Domains

Top Organizations

Top Categories

Top Collections

Platform

Model Details

AI Tools Usage

Introduction

Benchmarking

Training

Parameters

Top Tasks

Top Countries

Top Domains

Top Organizations

Top Categories

Top Collections

Platform

Model Details

AI Tools Usage

Introduction

Benchmarking

Training

Parameters