使用Mindspore模型训练时出现梯度为0现象

huan666 · 2025 年8 月 13 日 20:47

1 系统环境

硬件环境(Ascend/GPU/CPU): Ascend/GPU/CPU
MindSpore版本: mindspore=2.2.0
执行模式（PyNative/ Graph）:不限
Python版本: Python=3.7
操作系统平台: 不限

2 报错信息

2.1问题描述

使用Mindspore模型训练时出现梯度为0现象

2.2 脚本代码

def forward_fn(inputs, targets):
    logits = model(inputs)
    loss = criterion(logits, targets)
    return loss, logits

# get grad function
grad_fn = ms.value_and_grad(forward_fn, None, optimizer.parameters, has_aux=True)

# define train step function
def train_step(inputs, targets):
    (loss, logits), grads = grad_fn(inputs, targets) # get values and gradients
    print('grads=',grads)
    optimizer(grads) # update gradient
    return loss, logits

3 根因分析

分析代码，梯度是在value_and_grad中获取的，看看官网的介绍，第三个位置的weight需要net.trainable_params() 获取，代码里的net是model

4 解决方案

修改

optimizer.parameters为 model.trainable_params() 
# get grad function 
grad_fn = ms.value_and_grad(forward_fn, None, model.trainable_params(), has_aux=True)

话题		回复	浏览量
MindSpore梯度计算报错AttributeError: module ‘mindspore’ has no attribute ‘value_and_grad’分析及解决功能调试-Function Debugging	0	12	2026 年3 月 1 日
函数变换获得梯度计算函数时报错AttributeError: module 'mindspore' has no attribute 'value_and_grad' 模型训练-Model Training	0	28	2025 年8 月 5 日
MindSpore动态图模式下梯度计算报错AttributeError: module 'mindspore' has no attribute 'value_and_grad' 模型训练-Model Training	1	25	2026 年3 月 1 日
MindSpore不能像torch的param.grad直接获取梯度问题及解决模型训练-Model Training	0	33	2025 年8 月 9 日
解决MindSpore神经网络训练中的梯度消失问题模型训练-Model Training	0	40	2025 年9 月 2 日

使用Mindspore模型训练时出现梯度为0现象

1 系统环境

2 报错信息

2.1问题描述

3 根因分析

4 解决方案

相关话题