理解tf.nn.depthwise_conv2d

import tensorflow as tf import numpy as np import os os.environ['TF_CPP_MIN_LOG_LEVEL'] = '1' tf.compat.v1.logging.set_verbosity(tf.compat.v1.logging.ERROR) np.random.seed(2020) print('tf.__version__', tf.__version__) def get_data_batch(): bs = 2 h = 3 w = 3 c = 4 x_np = np.random.rand(bs, h, w, c) x_np = x_np.astype(np.float32) print('x_np.shape', x_np.shape) return x_np def run_conv_dw(): print('='*60) x_np = get_data_batch() in_channels = x_np.shape[-1] kernel_size = 3 channel_multiplier = 1 with tf.Session() as sess: x_tf = tf.convert_to_tensor(x_np) filter = tf.get_variable('w1', [kernel_size, kernel_size, in_channels, channel_multiplier], initializer=tf.contrib.layers.xavier_initializer()) z_tf = tf.nn.depthwise_conv2d(x_tf, filter=filter, strides=[1, 1, 1, 1], padding='SAME') sess.run(tf.global_variables_initializer()) z_np = sess.run(fetches=[z_tf], feed_dict={x_tf: x_np})[0] print('z_np.shape', z_np.shape) if '__main__' == __name__: run_conv_dw()

1条回答

网友

1楼 · 发布于 2024-05-16 07:09:43

用英语来说：

每组始终有一个输入通道，“通道\倍增器”输出每组通道数
不是一步到位
见1

我看到了一种方法来模拟每个组的几个输入通道。对于两张，执行depthwise_conv2d，然后将结果张量作为一副牌对半分割，然后将获得的两半元素相加（在relu等之前）。注意，输入通道号i将与i+inputs/2一分组

编辑：上面的技巧对于小的组很有用，对于大的组，只需将输入张量拆分为N个部分，其中N是组计数，分别对每个部分进行conv2d，然后连接结果

相关问题更多 >

编程相关推荐

热门问题

热门文章