Matern#

class sklearn.gaussian_process.kernels.Matern(length_scale=1.0, length_scale_bounds=(1e-05, 100000.0), nu=1.5)[source]#

Matern 核。

Matern 核类是 RBF 核的推广。它有一个额外的参数 \(\nu\)，用于控制结果函数的平滑度。 \(\nu\) 越小，近似函数越不平滑。当 \(\nu\rightarrow\infty\) 时，核函数等同于 RBF 核。当 \(\nu = 1/2\) 时，Matérn 核与绝对指数核完全相同。重要的中间值包括 \(\nu=1.5\)（一次可微分函数）和 \(\nu=2.5\)（两次可微分函数）。

该核由下式给出

\[k(x_i, x_j) = \frac{1}{\Gamma(\nu)2^{\nu-1}}\Bigg( \frac{\sqrt{2\nu}}{l} d(x_i , x_j ) \Bigg)^\nu K_\nu\Bigg( \frac{\sqrt{2\nu}}{l} d(x_i , x_j )\Bigg)\]

其中 \(d(\cdot,\cdot)\) 是欧几里得距离，\(K_{\nu}(\cdot)\) 是修正的贝塞尔函数，\(\Gamma(\cdot)\) 是伽马函数。有关 Matérn 核不同变体的详细信息，请参见[1]，第4章，第4.2节。

在用户指南中阅读更多内容。

版本 0.18 新增。

参数:

length_scalefloat 或 shape (n_features,) 的 ndarray，默认值=1.0: 核的长度尺度。如果为 float，则使用各向同性核。如果为 array，则使用各向异性核，其中 l 的每个维度定义相应特征维度的长度尺度。
length_scale_bounds一对 float >= 0 或 “fixed”，默认值=(1e-5, 1e5): “length_scale” 的下限和上限。如果设置为 “fixed”，则在超参数调整期间不能更改 “length_scale”。
nufloat，默认值=1.5: 控制学习函数平滑度的参数 nu。nu 越小，近似函数越不平滑。对于 nu=inf，该核等同于 RBF 核；对于 nu=0.5，该核等同于绝对指数核。重要的中间值是 nu=1.5（一次可微分函数）和 nu=2.5（两次可微分函数）。请注意，不在 [0.5, 1.5, 2.5, inf] 中的 nu 值会产生相当高的计算成本（大约高出 10 倍），因为它们需要评估修正的贝塞尔函数。此外，与 l 不同，nu 保持固定为其初始值，不进行优化。

References

[1]

Carl Edward Rasmussen, Christopher K. I. Williams (2006). “Gaussian Processes for Machine Learning”. The MIT Press.

示例

>>> from sklearn.datasets import load_iris
>>> from sklearn.gaussian_process import GaussianProcessClassifier
>>> from sklearn.gaussian_process.kernels import Matern
>>> X, y = load_iris(return_X_y=True)
>>> kernel = 1.0 * Matern(length_scale=1.0, nu=1.5)
>>> gpc = GaussianProcessClassifier(kernel=kernel,
...         random_state=0).fit(X, y)
>>> gpc.score(X, y)
0.9866
>>> gpc.predict_proba(X[:2,:])
array([[0.8513, 0.0368, 0.1117],
        [0.8086, 0.0693, 0.1220]])

__call__(X, Y=None, eval_gradient=False)[source]#

返回核 k(X, Y) 及其可选的梯度。

参数:

Xndarray of shape (n_samples_X, n_features): 返回的核 k(X, Y) 的左参数
Yndarray of shape (n_samples_Y, n_features), default=None: 返回的核 k(X, Y) 的右参数。如果为 None，则改为计算 k(X, X)。
eval_gradientbool, default=False: 确定是否计算关于核超参数对数的梯度。仅当 Y 为 None 时支持。

返回:

Kndarray of shape (n_samples_X, n_samples_Y): 核 k(X, Y)
K_gradientndarray of shape (n_samples_X, n_samples_X, n_dims), optional: 核 k(X, X) 关于核超参数对数的梯度。仅当 eval_gradient 为 True 时返回。

property bounds#

返回 theta 的对数变换边界。

返回:

boundsndarray of shape (n_dims, 2): 核超参数 theta 的对数变换边界

clone_with_theta(theta)[source]#

返回 self 的克隆，具有给定的超参数 theta。

参数:

thetandarray of shape (n_dims,): 超参数

diag(X)[source]#

返回核 k(X, X) 的对角线。

此方法的结果与 np.diag(self(X)) 相同；然而，由于只评估对角线，它可以更有效地评估。

参数:

Xndarray of shape (n_samples_X, n_features): 返回的核 k(X, Y) 的左参数

返回:

K_diagndarray of shape (n_samples_X,): 核 k(X, X) 的对角线

get_params(deep=True)[source]#

获取此核的参数。

参数:

deepbool, default=True: 如果为 True，将返回此估计器以及包含的子对象（如果它们是估计器）的参数。

返回:

paramsdict: 参数名称映射到其值。

property hyperparameters#: 返回所有超参数规范的列表。

is_stationary()[source]#: 返回核是否平稳。

property n_dims#: 返回核的非固定超参数的数量。

property requires_vector_input#: 返回核是定义在固定长度特征向量上还是定义在一般对象上。为了向后兼容，默认为 True。

set_params(**params)[source]#

设置此核的参数。

此方法适用于简单核以及嵌套核。后者具有 <component>__<parameter> 形式的参数，因此可以更新嵌套对象的每个组件。

返回:

self

property theta#

返回（展平的、对数变换的）非固定超参数。

请注意，theta 通常是核超参数的对数变换值，因为这种搜索空间的表示形式更适合超参数搜索，因为像长度尺度这样的超参数自然存在于对数尺度上。

返回:

thetandarray of shape (n_dims,): 核的非固定、对数变换超参数

Gallery examples#

不同核的先验和后验高斯过程图示

Matern#

Gallery examples#

本页