keras进阶，我从Layer开始之七

工程师milter 2020-08-20

1516

今天继续读keras中的Layer源码。我推荐你打开源码和我同步阅读。

293-400

def assert_input_compatibility(self, inputs):
        """Checks compatibility between the layer and provided inputs.

        This checks that the tensor(s) `input`
        verify the input assumptions of the layer
        (if any). If not, exceptions are raised.

        # Arguments
            inputs: input tensor or list of input tensors.

        # Raises
            ValueError: in case of mismatch between
                the provided inputs and the expectations of the layer.
        """
        inputs = to_list(inputs)
        for x in inputs:
            try:
                K.is_keras_tensor(x)
            except ValueError:
                raise ValueError('Layer ' + self.name + ' was called with '
                                 'an input that isn\'t a symbolic tensor. '
                                 'Received type: ' +
                                 str(type(x)) + '. Full input: ' +
                                 str(inputs) + '. All inputs to the layer '
                                 'should be tensors.')

        if not self.input_spec:
            return
        if not isinstance(self.input_spec, (list, tuple)):
            input_spec = to_list(self.input_spec)
        else:
            input_spec = self.input_spec
        if len(inputs) != len(input_spec):
            raise ValueError('Layer ' + self.name + ' expects ' +
                             str(len(input_spec)) + ' inputs, '
                             'but it received ' + str(len(inputs)) +
                             ' input tensors. Input received: ' +
                             str(inputs))
        for input_index, (x, spec) in enumerate(zip(inputs, input_spec)):
            if spec is None:
                continue

            # Check ndim.
            if spec.ndim is not None:
                if K.ndim(x) != spec.ndim:
                    raise ValueError('Input ' + str(input_index) +
                                     ' is incompatible with layer ' +
                                     self.name + ': expected ndim=' +
                                     str(spec.ndim) + ', found ndim=' +
                                     str(K.ndim(x)))
            if spec.max_ndim is not None:
                ndim = K.ndim(x)
                if ndim is not None and ndim > spec.max_ndim:
                    raise ValueError('Input ' + str(input_index) +
                                     ' is incompatible with layer ' +
                                     self.name + ': expected max_ndim=' +
                                     str(spec.max_ndim) + ', found ndim=' +
                                     str(K.ndim(x)))
            if spec.min_ndim is not None:
                ndim = K.ndim(x)
                if ndim is not None and ndim < spec.min_ndim:
                    raise ValueError('Input ' + str(input_index) +
                                     ' is incompatible with layer ' +
                                     self.name + ': expected min_ndim=' +
                                     str(spec.min_ndim) + ', found ndim=' +
                                     str(K.ndim(x)))
            # Check dtype.
            if spec.dtype is not None:
                if K.dtype(x) != spec.dtype:
                    raise ValueError('Input ' + str(input_index) +
                                     ' is incompatible with layer ' +
                                     self.name + ': expected dtype=' +
                                     str(spec.dtype) + ', found dtype=' +
                                     str(K.dtype(x)))
            # Check specific shape axes.
            if spec.axes:
                try:
                    x_shape = K.int_shape(x)
                except TypeError:
                    x_shape = None
                if x_shape is not None:
                    for axis, value in spec.axes.items():
                        if (value is not None and
                                x_shape[int(axis)] not in {value, None}):
                            raise ValueError(
                                'Input ' + str(input_index) +
                                ' is incompatible with layer ' +
                                self.name + ': expected axis ' +
                                str(axis) + ' of input shape to have '
                                'value ' + str(value) +
                                ' but got shape ' + str(x_shape))
            # Check shape.
            if spec.shape is not None:
                try:
                    x_shape = K.int_shape(x)
                except TypeError:
                    x_shape = None
                if x_shape is not None:
                    for spec_dim, dim in zip(spec.shape, x_shape):
                        if spec_dim is not None and dim is not None:
                            if spec_dim != dim:
                                raise ValueError(
                                    'Input ' + str(input_index) +
                                    ' is incompatible with layer ' +
                                    self.name + ': expected shape=' +
                                    str(spec.shape) + ', found shape=' +
                                    str(x_shape))

在assert_input_compatibility中，检查完tensor后，进入shape的检查。每一个tensor对应一个input_spec，如果对应不上，就会报错。

看到首先检查的是tensor的维度数，即ndim。其次是 max_ndim和min_ndim。

维度检查完毕，就开始检查数据类型。360-366

继续，检查每一个axis的维度数。368-383 这里的检查是比较宽松的，因为允许维度是None。

紧接着的shape检查就会要求二者严格相等了。385-399

看起来，spec.shape和spec.axes更像是在版本迭代中重叠起来的。为了兼容，一直保留了下来。

401-412

这里是call方法，留给具体的Layer进行实现。其实就是模板设计模式。我们自定义自己的Layer时，通过重写这个方法实现自己的计算逻辑。

413-540

这里是call方法的包装方法，用来处理预处理一些keras的记录信息。

读一下注释，可以发现这个包装方法主要完成四项功能：

调用_add_inbound_node()方法

这里想进一步理解一下inbound_node这个概念。我们的layer创建好后，可以多次调用，如下所示：

myLayer = MyLayer()
o1 = myLayer(inputs1)
o2 = myLayer(inputs2)

inputs1和inputs2是input tensor的列表，里面的每个input都来自某一个上游的Layer的output。但是每个列表中的input未必来自同一个Layer的output。

在上面，由于我们调用了两次myLayer。所以，在myLayer的inbound_nodes数组中就会添加两个node，来分别记录这两次调用的信息。具体记录哪些信息，就在这个包装的__call__方法中。

如果layer还没有built，会进行build方法的调用
更新_keras_shape，这个需要结合代码进行理解
对每一个output tensor, 更新它的_keras_history。这个在上篇文章中已经讲过。每当产生一个新的tensor，及时确定它的三维坐标（layer，node_index，output_tensor_index)

这段注释，我觉得写的非常好。值得学习。提纲挈领地说明了这个方法的内容。这样，在下面的阅读中，就不会迷失在浩繁的细节中。