人工智能 - 如何训练一个权重和偏差保持不变的神经网络？ - 吾爱随笔录

如何训练一个权重和偏差保持不变的神经网络？

人工智能神经网络张量流喀拉斯权重

2021-10-20 09:55:12

我是神经网络的初学者。我正在构建一个具有 3 层的神经网络。输入 $X$ 有 7 个特征和输出 $Y$ 是一个实数。在隐藏层中，有两个节点。底部节点包含应该很难设置的权重和偏差。

现在，我想用训练数据训练这个神经网络 $X$ 和 $Y$ ，使得红色权重保持不变，而所有其他权重都是可学习的。

在神经网络的训练过程中有没有办法做到这一点？我正在使用 TensorFlow 和 Keras，因此，如果您还可以提供执行此操作所需的代码，那将非常有用。

1个回答

该策略是将 b2 变成一个单独的模型，按照应有的方式初始化 b2，并像往常一样在没有 b2 的情况下训练您的网络。

在主网络的中间，使用连接函数将b1层和b2网络的输出结合起来，例如在TensorFlow中：

# Axis 0 is the batch dimension, axis 1 for the dimension of value in every sample
tf.concat([b1out,b2out],axis=1)

示例源代码（粘贴到 Google Colab 进行测试）：

%tensorflow_version 2.x
%reset -f

import tensorflow                   as     tf
from   tensorflow.keras             import *
from   tensorflow.keras.layers      import *
from   tensorflow.keras.activations import *
from   tensorflow.keras.models      import *
from   tensorflow.keras.callbacks   import *

class MyModel(Model):
    def __init__(self):
        super(MyModel,self).__init__()
        self.dense1 = Dense(200, activation=relu)
        self.dense2 = Dense(1,   activation=tf.identity)

    @tf.function
    def call(self,x):
        # MIND THE CONCATENATION IN THIS PART:
        h1a = self.dense1(x)
        h1b = b2(x) # b2 won't get trained, so it stays fixed

        # CONCATENATION AT AXIS 1, COZ AXIS 0 IS BATCH DIMENSION
        h1  = tf.concat([h1a,h1b],axis=1) 
        u   = self.dense2(h1)
        return u

class ModelB2(Model):
    def __init__(self):
        super(ModelB2,self).__init__()
        self.dense1 = Dense(200)
        # self.dense2 = ...
        # Init weights of B here or B is pre-trained model

    @tf.function
    def call(self,x):
        u = self.dense1(x)
        return u

# PROGRAMME ENTRY POINT ========================================================
if __name__=="__main__":
    inp = [[1,2,3,4,5,6,7],[7,6,5,4,3,2,1]] # Example values
    exp = [[0],            [1]            ]

    mm = MyModel()
    b2 = ModelB2()

    mm.compile(loss=tf.losses.MeanSquaredError(), optimizer=tf.optimizers.Adam(1e-3))
    b2.compile(loss=tf.losses.MeanSquaredError(), optimizer=tf.optimizers.Adam(1e-3))
    mm_loss = mm.evaluate(x=inp,y=exp, batch_size=len(inp), steps=1) # Init weights in here
    b2_loss = b2.evaluate(x=inp,y=exp, batch_size=len(inp), steps=1) # Init weights in here

    print("\nbefore training:")
    print("mm weights:",mm.get_weights()[0][0][:3],"...")
    print("b2 weights:",b2.get_weights()[0][0][:3],"...")
    print("mm loss:",mm_loss)
    print("b2 loss:",b2_loss)

    mm.fit(x=inp,y=exp, batch_size=len(inp), epochs=500, verbose=0)
    mm_loss = mm.evaluate(x=inp,y=exp, batch_size=len(inp), steps=1) 
    b2_loss = b2.evaluate(x=inp,y=exp, batch_size=len(inp), steps=1) 

    print("\nafter training:")
    print("mm weights:",mm.get_weights()[0][0][:3],"...")
    print("b2 weights:",b2.get_weights()[0][0][:3],"... <-- UNCHANGED AS WANTED")
    print("mm loss:",mm_loss)
    print("b2 loss:",b2_loss,"<-- UNCHANGED AS WANTED")
# EOF

谷歌 colab：https ://colab.research.google.com

其它你可能感兴趣的问题

上一篇特征工程师是深度学习方法的重要一步吗？下一篇如果H1( n )h1(n)是可以接受的，为什么 A* 树搜索H2( n ) = 3H1( n )h2(n)=3h1(n)返回一条最多是最佳路径三倍的路径？