Model Details

Domain:

Task:

Quantitative reasoning

Code generation

Translation

Model Access:

Open weights (restricted use)

Introduction

In recent months, our focus has been on developing a “good” model while optimizing the developer experience. As we progress towards Qwen1.5, the next iteration in our Qwen series, this update arrives just before the Chinese New Year. With Qwen1.5, we are open-sourcing base and chat models across six sizes: 0.5B, 1.8B, 4B, 7B, 14B, and 72B. In line with tradition, we’re also providing quantized models, including Int4 and Int8 GPTQ models, as well as AWQ and GGUF quantized models. To enhance the developer experience, we’ve merged Qwen1.5’s code into Hugging Face transformers, making it accessible with transformers>=4.37.0 without needing trust_remote_code.

Benchmarking

FLOPs

1.3e+24

Notes: 3T training tokens: https://github.com/QwenLM/Qwen2/issues/97 6 * 72 billion * 3 trillion = ~1.3e24

Training

Training Code Accessibility

restriction on >100m monthly users: https://huggingface.co/Qwen/Qwen1.5-72B/blob/main/LICENSE

Size Notes: 3 trillion tokens from this response https://github.com/QwenLM/Qwen2/issues/97

Parameters

72000000000

Notes: 72B

Authors

Qwen Team

Related Models

Qwen1.5-72B - Use Model

Qwen1.5-72B - Use Model

Model Details

Introduction

Benchmarking

Training

Parameters

Authors

Related Models

Qwen3-Omni-30B-A3B

Qwen3-Next-80B-A3B

Wan 2.2 14B I2V

Wan 2.2 14B T2V

Qwen1.5-72B - Use Model

Qwen1.5-72B - Use Model

Model Details

Introduction

Benchmarking

Training

Parameters

Authors

Related Models

Qwen3-Omni-30B-A3B

Qwen3-Next-80B-A3B

Wan 2.2 14B I2V

Wan 2.2 14B T2V