MindSpore YOLO分布式训练报错Invalid argument

1 系统环境

硬件环境(Ascend/GPU/CPU): Ascend
MindSpore版本: mindspore=2.5.0、mindyolo =0.5.0
执行模式(PyNative/ Graph): 不限
Python版本: Python=3.9
操作系统平台: Ubuntu18.04

2 报错信息

2.1 脚本信息

使用如下语法导入TextClassifier报错:

pip install -r mindyolo-master/requirements.txt
安装mindyolo依赖并配置好自定义数据集配置文件和模型配置文件

msrun --worker_num=8 --local_worker_num=8 --bind_core=True --log_dir=./yolov11_log python mindyolo-master/train.py --config mindyolo-master/configs/yolov11/yolov11-m.yaml --device_target Ascend --is_parallel True # yolo11 Distributed Training :https://gitee.com/mindspore-lab/mindyolo/tree/master/configs/yolov11

2.2 报错信息

scheduler.log:
/home/mindspore/miniconda/envs/jupyter/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero.
setattr(self, word, getattr(machar, word).flat[0])
/home/mindspore/miniconda/envs/jupyter/lib/python3.9/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero.
return self._float_to_str(self.smallest_subnormal)
/home/mindspore/miniconda/envs/jupyter/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
setattr(self, word, getattr(machar, word).flat[0])
/home/mindspore/miniconda/envs/jupyter/lib/python3.9/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
return self._float_to_str(self.smallest_subnormal)
worker_0.log:
/home/mindspore/miniconda/envs/jupyter/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero.
setattr(self, word, getattr(machar, word).flat[0])
/home/mindspore/miniconda/envs/jupyter/lib/python3.9/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero.
return self._float_to_str(self.smallest_subnormal)
/home/mindspore/miniconda/envs/jupyter/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
setattr(self, word, getattr(machar, word).flat[0])
/home/mindspore/miniconda/envs/jupyter/lib/python3.9/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
return self._float_to_str(self.smallest_subnormal)
[WARNING] DISTRIBUTED(4166,ffff899f6020,python):2025-07-07-02:56:25.084.274 [mindspore/ccsrc/distributed/rpc/tcp/tcp_comm.cc:485] Connect] Connection 18 source: 127.0.0.1:39840, destination: 127.0.0.1:8118
[WARNING] DISTRIBUTED(4166,ffff899f6020,python):2025-07-07-02:56:25.084.364 [mindspore/ccsrc/distributed/rpc/tcp/tcp_client.cc:76] Connect] Failed to connect to the tcp server : 127.0.0.1:8118, retry to reconnect(1/1)...

3 根因分析

此处由用户填写

4 解决方案

此处由用户填写
包含文字方案和最终脚本代码 请将正确的脚本打包并上传附件