QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD

介绍

今年一月底,我们推出了 Qwen2.5-VL 系列模型,获得了社区的广泛关注和积极反馈。在 Qwen2.5-VL 系列的基础上,我们使用强化学习持续优化模型,并使用 Apache 2.0 协议开源 32B 这个备受喜爱的参数规模的新 VL 模型—— Qwen2.5-VL-32B-Instruct。相比此前发布的 Qwen2.5-VL 系列模型,本次推出的 32B 模型的特点如下:

  • 回复更符合人类主观偏好:调整了输出风格,使回答更加详细、格式更规范,并更符合人类偏好。
  • 数学推理能力:复杂数学问题求解的准确性显著提升。
  • 图像细粒度理解与推理:在图像解析、内容识别以及视觉逻辑推导等任务中表现出更强的准确性和细粒度分析能力。

性能表现

我们与业内先进的同规模模型进行比较,包括近期推出的 Mistral-Small-3.1-24B 和 Gemma-3-27B-IT, Qwen2.5-VL-32B-Instruct 展现出了明显的优势,甚至超越了更大规模的 Qwen2-VL-72B-Instruct 模型。尤其是在多模态任务中,例如 MMMUMMMU-ProMathVista,这些任务强调复杂的多步骤推理,Qwen2.5-VL-32B-Instruct 表现尤为突出。在注重主观用户体验评估的 MM-MT-Bench 基准测试中,该模型相较于其前代 Qwen2-VL-72B-Instruct 取得了显著进步。

除了在视觉能力上优秀,Qwen2.5-VL-32B-Instruct 在纯文本能力上也达到了同规模的最优表现。

样例

Fine-grained Image Understanding and Reasoning Next
User
I am driving a large truck on this road, and it is now 12 o’clock. Can I reach a place 110 kilometers away before 13 o’clock? image
Qwen2.5-VL-32B

To determine whether you can reach a place 110 kilometers away before 13:00, we need to analyze the information provided:

  1. Current Time: It is 12:00.
  2. Distance to Destination: 110 kilometers.
  3. Speed Limit for Trucks: The speed limit sign indicates that the maximum speed for trucks is 100 km/h.

Step-by-Step Analysis:

1. Calculate the Required Travel Time

The formula to calculate travel time is: \[ \text{Time} = \frac{\text{Distance}}{\text{Speed}} \] Given:

  • Distance = 110 km
  • Maximum Speed = 100 km/h

Substitute the values: \[ \text{Time} = \frac{110 \text{ km}}{100 \text{ km/h}} = 1.1 \text{ hours} \]

2. Convert Time to Hours and Minutes

1.1 hours is equivalent to: \[ 1 \text{ hour} + 0.1 \times 60 \text{ minutes} = 1 \text{ hour} + 6 \text{ minutes} = 1 \text{ hour and } 6 \text{ minutes} \]

3. Determine Arrival Time

If you start at 12:00 and it takes 1 hour and 6 minutes to travel 110 kilometers, your arrival time will be: \[ 12:00 + 1 \text{ hour and } 6 \text{ minutes} = 13:06 \]

4. Compare with Target Time

You want to arrive before 13:00. However, based on the calculations, you will arrive at 13:06, which is after 13:00.

Conclusion:

Given the speed limit of 100 km/h for trucks, it is not possible to reach a place 110 kilometers away before 13:00.

Thus, the final answer is: \[ \boxed{\text{No}} \]

下一步计划

尽管 Qwen2.5-VL-32B 在强化学习框架下优化了主观体验和数学推理能力——这主要基于“快速思考”模式,但我们的下一步研究将聚焦于长且有效的推理过程,以突破视觉模型在处理高度复杂、多步骤视觉推理任务中的边界。

引用

如果你觉得我们的模型对你有帮助,欢迎在你的工作中引用我们的技术报告:

@article{Qwen2.5-VL,
  title={Qwen2.5-VL Technical Report},
  author={Bai, Shuai and Chen, Keqin and Liu, Xuejing and Wang, Jialin and Ge, Wenbin and Song, Sibo and Dang, Kai and Wang, Peng and Wang, Shijie and Tang, Jun and Zhong, Humen and Zhu, Yuanzhi and Yang, Mingkun and Li, Zhaohai and Wan, Jianqiang and Wang, Pengfei and Ding, Wei and Fu, Zheren and Xu, Yiheng and Ye, Jiabo and Zhang, Xi and Xie, Tianbao and Cheng, Zesen and Zhang, Hang and Yang, Zhibo and Xu, Haiyang and Lin, Junyang},
  journal={arXiv preprint arXiv:2502.13923},
  year={2025}
}