模型并行显示内存溢出

chengxiaoli (Cheng_li) 2025 年10 月 23 日 03:43 1

1 系统环境

硬件环境(Ascend/GPU/CPU): Ascend
MindSpore版本: 2.2.0
执行模式（PyNative/ Graph）: 不限

评论区主理人，您准备好了吗？

话题	回复	浏览量
MindSpore报错Please try to reduce 'batch_size' or check whether exists extra large shape.方法二分布式并行-Distributed Parallelsim	9	2025 年10 月 21 日
MindSpore数据并行报错Call GE RunGraphWithStreamAsync Failed，EL0004: Failed to allocate memory. 数据加载及处理-Data Loading&Processing	16	2025 年10 月 10 日
增加数据并行数之后模型占用显存增加分布式并行-Distributed Parallelsim	40	2025 年9 月 26 日
并行策略为8:1:1时报错RuntimeError: May you need to check if the batch size etc. in your 'net' and 'parameter dict' are same. 分布式并行-Distributed Parallelsim	16	2025 年10 月 4 日
MindSpore大模型并行需要在对应的yaml里面做哪些配置分布式并行-Distributed Parallelsim	34	2025 年10 月 1 日