1 系统环境
硬件环境(Ascend/GPU/CPU): Ascend/GPU/CPU
MindSpore版本: mindspore=2.0.0
执行模式(PyNative/ Graph):不限
Python版本: Python=3.7.5
操作系统平台: 不限
2 报错信息
2.1 问题描述
定义了一个transforms的操作集合,通过ImageFolderDataset读取图片数据,transforms_list里面包含转ToPIL以及Resize操作,运行之后,会出现如下报错。显示图片的维度信息不对。
2.2 脚本信息
import mindspore.dataset as ds
import mindspore.dataset.vision as vision
import nupy as np
from mindspore.dataset.transforms import Compose
from PIL import Image
image path = "/data2/shenwei/Minddata/test/mindspore/tests/ut/data/dataset/testPK/data"
transforms_List = Compose([vision.ToPIL(),vision.Resize((200, 200))])
# pipeline mode
dataset = ds.ImageFoderDataset(image_path, shuffle=False)
dataset = dataset.map(operations=transforms_list, input_coumns="image")
for item in dataset.create_dict_iterator():
print(item[″image"].shape)
print(item[″image"].dtype)
break
# eager mode
img = np.fromfile("/data2/shenwei/Minddata/test/mindspore/tests/ut/data/dataset/testPK/data/class1/Θ.jpg",np.uint8)
img = transforms_list(img)[0]
assert isinstance(img, Image.Image) print(type(img))
2.3 问题描述
Traceback (most recentcall ast):
File "tset_numpy.py", line 15, in<module>
for item in dataset.create_dict .iterator():
File"/home/shenwei/.conda/envs/㏕ib/python3.7/s ite-packages/mindspore/dataset/engine/iterators.py",line 152, in _next_
data = sef._get_next( )
File "/home/shenwei/.conda/envs/lib/python3.7/s ite-packages/mindspore/dataset/engine/iterators.py", line 277, in _get_next
raise err
File"/home/shenwei/.conda/envs/lib/python3.7/site-packages/mindspore/dataset/engine/iterators.py", line 260, in _get_next
-Python Call stack:
map operation: [PyFunc] failed. Thecorresponding data file is: /data2/shenwei/Minddata/test/mindspore/tests/ut/data/dataset/testpK/data/class1/2.jpg. Error description: valueÈrror: Traceback (most recentcall last):
File'"/home/shenwei/.conda/envs/shenweb/python3.7/s ite-packages/mindspore/dataset/transforms/py_transforms_util.py", line 63, in copose
args = transform(*args )
File "/home/shenwei/.conda/envs/lib/python3.7/s ite-packages/mindspore/dataset/transforms/transforms.py", Line 133, in _call_
return self._execute py( img)
File"/home/shenwei/.conda/envs/lib/python3.7/s ite-packages/mindspore/dataset/vision/transforms.py",line 4739, in _execute_py
return util.to_pil( img)
File "/home/shenwei/ conda/envs/lib/python3.7/s ite-packages/mindspore/dataset/vis ion/py_transforms_util.py", line 178, in to_pil
raise ValueError("Thedimens ion of input image should be 2or 3. Got {}.".format(img.ndim))
ValueError: The dimens ion ofinput image should be 2 or 3. Got 1
- Dataset Pipeline Error Message:
ERROR Execute user Python code failed. check 'pvthon Call Stack' above
- C++ Call Stack: (For framework developers.
mindspore/ccsrc/minddata/dataset/engine/datasetops/map op/map iob.h(57)复制
3 根因分析
代码有两个错:(还有一些字符输入错误的暂不管)
1.ImageFolderDataset 读取目录下的图像文件后输入的就是一个图像, 所以第一步应该是做Decode
做完Decode后可以调用vision.ToPIL(), 或者直接调用Decode的时候指定参数, vision.Decode(to_pil=True)
2.eager mode下输入的是一个numpy, 是可以直接用vision.ToPIL()的
但是np.fromfile读取图像的rank是1 同样会报错, 所以要用Image.open后转numpy才行
4 解决方案
import mindspore.dataset as ds
import mindspore.dataset.vision as vision
import numpy as np
from mindspore.dataset.transforms import Compose
from PIL import Image
image_path="C:\\temp\\pic"
transforms_list = Compose([#vision.Decode(),
#vision.ToPIL(),
vision.Decode(to_pil=True),
vision.Resize((200, 200))])
#transforms_list = Compose([vision.ToPIL(),vision.Resize((200, 200))])
# pipeline mode
dataset = ds.ImageFolderDataset(image_path, shuffle=False)
dataset = dataset.map(operations=transforms_list, input_columns="image")
for item in dataset.create_dict_iterator():
print(item["image"].shape)
print(item["image"].dtype)
break
# eager mode
transforms_list1 = Compose([vision.ToPIL(),vision.Resize((200, 200))])
#img = np.fromfile("C:\\temp\\pic\\01\\000000000064.jpg",np.uint8)
img = Image.open("C:\\temp\\pic\\01\\000000000064.jpg")
img = np.array(img)
img = transforms_list1(img)[0]
assert isinstance(img, Image.Image)
print(type(img))
