数据挖掘 - 什么是自定义 SVM 内核？ - 吾爱随笔录

什么是自定义 SVM 内核？

数据挖掘机器学习分类机器学习模型支持向量机核心

2022-02-19 09:21:03

什么是支持向量机中的自定义内核。它与多项式内核有何不同。如何实现自定义内核。您能否提供代码来实现自定义内核。

1个回答

自定义内核可以是任何用户定义的函数，它可以转换训练数据集，以便可以将非线性边界转换为更高维度的线性边界。

多项式内核只是我们也是 RBF、Sigmoid、线性、高斯和其他内核的一种内核。每个内核都有一些属性。

多项式内核：它表示特征空间中训练数据集中的向量与内核中使用的原始变量的多项式的相似性。

自定义内核的代码可以在 skicit learn文档中找到：

import numpy as np
import matplotlib.pyplot as plt
from sklearn import svm, datasets

# import some data to play with
iris = datasets.load_iris()
X = iris.data[:, :2]  # we only take the first two features. We could
# avoid this ugly slicing by using a two-dim dataset
Y = iris.target


def my_kernel(X, Y):
    """
    We create a custom kernel:

                 (2  0)
    k(X, Y) = X  (    ) Y.T
                 (0  1)
    """
    M = np.array([[2, 0], [0, 1.0]])
    return np.dot(np.dot(X, M), Y.T)


h = 0.02  # step size in the mesh

# we create an instance of SVM and fit out data.
clf = svm.SVC(kernel=my_kernel)
clf.fit(X, Y)

# Plot the decision boundary. For that, we will assign a color to each
# point in the mesh [x_min, x_max]x[y_min, y_max].
x_min, x_max = X[:, 0].min() - 1, X[:, 0].max() + 1
y_min, y_max = X[:, 1].min() - 1, X[:, 1].max() + 1
xx, yy = np.meshgrid(np.arange(x_min, x_max, h), np.arange(y_min, y_max, h))
Z = clf.predict(np.c_[xx.ravel(), yy.ravel()])

# Put the result into a color plot
Z = Z.reshape(xx.shape)
plt.pcolormesh(xx, yy, Z, cmap=plt.cm.Paired)

# Plot also the training points
plt.scatter(X[:, 0], X[:, 1], c=Y, cmap=plt.cm.Paired, edgecolors="k")
plt.title("3-Class classification using Support Vector Machine with custom kernel")
plt.axis("tight")
plt.show()

其它你可能感兴趣的问题

上一篇如何处理大量未标记的目标数据集？下一篇分类模型需要多少基本事实？