数据挖掘 - 神经网络得到了一个幸运的猜测。可以信任吗？ - 吾爱随笔录

神经网络得到了一个幸运的猜测。可以信任吗？

数据挖掘神经网络深度学习

2022-02-15 20:43:32

假设您遇到如下所示的损失曲线。您应该在哪种损失下信任该模型？最初的幸运猜测还是稳定之后？

更重要的是，为什么？

2个回答

曲线中的损失表明可以通过调整超参数来改进训练，尤其是学习率和/或批量大小。因此，最佳决策是继续完善模型，而不是使用已经训练好的模型。

如果调整超参数不是一种选择，至少您应该重新拆分训练和验证数据，或者至少使用不同的随机种子重复。

如果这些都不是一个选项，你应该采取最好的验证损失。原因是我们假设验证数据没有泄漏到训练数据中，并且验证数据代表了将要测试模型的数据。在没有任何其他证据的情况下，我们应该假设您找到的“最佳位置”也会在模型以前从未见过的测试数据中产生更好的结果。

You should always look at validation loss, you don't care about training when evaluating your model's performance.

But your idea is quite in line with the principle of early stopping is: keep training the model, checking its performance at each epoch; once you find a best loss value, save the model; stop training once you don't find any loss improvement for a number of epochs defined by your patience hyperparameter.

However, about your specific problem I agree with @ncasas, i.e. that looking at your image it seems your model can be improved.

其它你可能感兴趣的问题

上一篇r"""这是什么意思？""" 下一篇ValueError: pos_label=1 不是有效标签：array(['N', 'Y'], dtype='<U1')