打印张量

问题描述

我有一个映射到生成器的增强函数；但是，由于某种原因，tfa.image.rotate函数会导致错误。

def customGenerator(input_file_paths,dims,data_type):
    for i,file_path in enumerate(input_file_paths):
        if data_type.decode("utf-8") in ["png" or "tif"]:
            img = plt.imread((file_path.decode("utf-8")))
        elif data_type.decode("utf-8") == "npy":
            img = np.load(file_path.decode("utf-8"))
        x = resize(img[:,:,:3],dims)           
        yield x,x

def augment(image,label) :
    print('image',image)
    print('shape',image.shape)
    print('type',type(image))

    #angle = random.uniform(0,tf.constant(np.pi))
    image = tfa.image.rotate(image,tf.constant(np.pi)


train_dataset = tf.data.Dataset.from_generator(generator=customGenerator,output_types=(np.float32,np.float32),output_shapes=(dims,dims),args=[X_train_paths,"png"])

train_dataset = train_dataset.map(augment,num_parallel_calls=AUTOTUNE)

我查看了其他人使用的tfa.image.rotate的实现，他们的工作正常。我尝试在增强函数中打印image变量。结果是：

print('image',image) # these lines is in the augment function,result below
print('type',type(image))

image Tensor("args_0:0",shape=(256,256,3),dtype=float32)
type <class 'tensorflow.python.framework.ops.Tensor'>

相反，当我转到其他用户的实现并打印其图像时，该图像未映射到数据集中。他们的print(image)和print(type(image))打印出以下内容：

image tf.Tensor(
[[[1. 1. 1.]
  [1. 1. 1.]
  [1. 1. 1.]
  ...
  [1. 1. 1.]
  [1. 1. 1.]
  [1. 1. 1.]]

 ...

 [[1. 1. 1.]
  [1. 1. 1.]
  [1. 1. 1.]
  ...
  [1. 1. 1.]
  [1. 1. 1.]
  [1. 1. 1.]]],dtype=float32)

type <class 'tensorflow.python.framework.ops.EagerTensor'>

我希望在image函数中打印 augment时能打印出来。所以我不确定会发生什么。有几个问题。此外，tf.executing_eagerly()的结果为True

"args_0:0"到底是什么意思？

augment函数中的图像应该是<class 'tensorflow.python.framework.ops.EagerTensor'>类型，而不是普通张量吗？

我是否可以通过某种方法将"args_0:0"设置为我期望的打印数字数组的格式？因为我相信这会解决旋转功能

最后，如果没有，是否有更好的方法通过随机旋转来增强图像？

感谢您的时间和帮助。

解决方法

args_0:0是张量。参见here

为了使代码生效，我对您的代码进行了一些更改。

代码：

import tensorflow_addons as tfa
import os
import matplotlib.pyplot as plt
import tensorflow as tf
import numpy as np


def customGenerator(input_file_paths,dims,data_type):
    for i,file_path in enumerate(input_file_paths):
        image = tf.io.read_file(file_path)
        image = tf.image.decode_png(image,channels = 3)
        image = tf.image.convert_image_dtype(image,tf.float32)
        image = tf.image.resize(image,[dims[0],dims[1]])
        yield image,image

def augment(image,label) :
    img = tfa.image.rotate(image,tf.constant(np.pi/8))   
    return (img,label)

X_train_paths = [os.path.join('data','img',name) for name in os.listdir('data/img')]
dims = (256,256,3)

train_dataset = tf.data.Dataset.from_generator(generator=customGenerator,output_types=(tf.float32,tf.float32),output_shapes=(dims,dims),args=[X_train_paths,"png"])

train_dataset = train_dataset.map(augment)

遍历数据集：

for images in train_dataset:
    rotatedimg,normalimg= images[0],images[1]
    break

输出：

plt.imshow(rotatedimg)

plt.imshow(normalimg)

要记住的事情：

地图功能未急切执行。
始终在生成器函数和map函数内使用TensorFlow函数，因为TensorFlow将这些函数作为图形的一部分执行，以加快其执行速度。参见here
如果您使用其他功能，TensorFlow可能无法将这些操作转换为图形，这将导致错误。

data-augmentation image-rotation python tensorflow tensorflow-datasets

打印张量

问题描述

解决方法

相关问答