迁移网络任务-tacotron2时mindspore的权重初始化与torch的不一致

chengxiaoli · 2025 年9 月 14 日 16:23

1.系统环境

硬件环境(Ascend/GPU/CPU): GPU
MindSpore版本: mindspore=2.0
执行模式（PyNative/ Graph）:不限
Python版本：3.7
操作系统平台：Linux

2. 问题描述

代码：

ts.migrator.compare_pth_and_ckpt(  
    weight_map_path="/home/hodia/python/Pytorch-fasterRCNN/pt_net_info/torch_net_map.json",  
    pt_file_path="/home/hodia/python/Pytorch-fasterRCNN/pt_net_info/torch_troubleshooter_create.pth",  
    ms_file_path="/home/hodia/python/MindSpore-fasterRCNN/ms_net_info/mindspore_troubleshooter_create.ckpt",  
compare_value=True)

3. 解决方案

将pytorch的权重转化为mindspore，进行训练

ts.migrator.convert_weight_and_load(weight_map_path="/home/hodia/python/Pytorch-fasterRCNN/pt_net_info/torch_net_map.json", 
                                    pt_file_path="/home/hodia/python/Pytorch-fasterRCNN/pt_net_info/torch_troubleshooter_create.pth", 
                                    net=model)

主要表现在初始化中
Torch的初始化

for layer in self.children():  
    if isinstance(layer,nn.Conv2d):  
        torch.nn.init.normal_(layer.weight,std=0.01)  
        torch.nn.init.constant_(layer.bias,0)

MindSpore的初始化

for _, layer in self.cells_and_names():  
    if isinstance(layer,nn.Conv2d):  
        layer.weight.set_data(ms.common.initializer.initializer(ms.common.initializer.Normal(sigma=0.01,mean=0.0),layer.weight.shape,layer.weight.dtype))  
        layer.bias.set_data(ms.common.initializer.initializer("zeros",layer.bias.shape,layer.bias.dtype))

话题		回复	浏览量
如何使用MindSpore替换PyTorch的torch.nn.init 功能调试-Function Debugging	0	2	2025 年9 月 15 日
torch.nn.Conv2d和ms.nn.Conv2d精度对齐问题问题求助 Help 模型 , 调试 , api	5	28	2025 年7 月 19 日
迁移tacotron2网络到MindSpore时遇到torch.tensor.copy_函数缺失功能调试-Function Debugging	0	2	2025 年8 月 16 日
MindSpore实现Swin Transformer时遇到ms.common.initializer.Constant(0.0)(m.bias)不起初始化改变数值的作用功能调试-Function Debugging	0	1	2025 年8 月 29 日
迁移网络任务-tacotron2时遇到mindspore没有对应torch的tensor.clone接口功能调试-Function Debugging	0	6	2025 年8 月 4 日

迁移网络任务-tacotron2时mindspore的权重初始化与torch的不一致

1.系统环境

2. 问题描述

3. 解决方案

相关话题