PLSRegression#

class sklearn.cross_decomposition.PLSRegression(n_components=2, *, scale=True, max_iter=500, tol=1e-06, copy=True)[source]#

PLS 回归。

PLSRegression is also known as PLS2 or PLS1, depending on the number of targets.

有关其他交叉分解算法的比较，请参阅比较交叉分解方法。

在用户指南中阅读更多内容。

在版本 0.8 中添加。

参数:

n_componentsint, default=2: Number of components to keep. Should be in [1, n_features].
scalebool, default=True: 是否对 X 和 y 进行缩放。
max_iterint, default=500: The maximum number of iterations of the power method when algorithm='nipals'. Ignored otherwise.
tolfloat, default=1e-06: 在幂迭代法中用作收敛标准的容差：当 u_i - u_{i-1} 的平方范数小于 tol 时，算法停止，其中 u 对应于左奇异向量。
copy布尔值, 默认为 True: Whether to copy X and y in fit before applying centering, and potentially scaling. If False, these operations will be done inplace, modifying both arrays.

属性:

x_weights_ndarray of shape (n_features, n_components): 每次迭代的交叉协方差矩阵的左奇异向量。
y_weights_ndarray of shape (n_targets, n_components): 每次迭代的交叉协方差矩阵的右奇异向量。
x_loadings_ndarray of shape (n_features, n_components): X 的载荷。
y_loadings_ndarray of shape (n_targets, n_components): y 的载荷。
x_scores_ndarray of shape (n_samples, n_components): The transformed training samples.
y_scores_ndarray of shape (n_samples, n_components): The transformed training targets.
x_rotations_ndarray of shape (n_features, n_components): 用于转换 X 的投影矩阵。
y_rotations_ndarray of shape (n_targets, n_components): 用于转换 y 的投影矩阵。
coef_ndarray of shape (n_target, n_features): 线性模型的系数，使得 y 近似为 y = X @ coef_.T + intercept_。
intercept_ndarray of shape (n_targets,): 线性模型的截距，使得 y 近似为 y = X @ coef_.T + intercept_。

版本 1.1 中新增。
n_iter_list of shape (n_components,): Number of iterations of the power method, for each component.
n_features_in_int: 在拟合期间看到的特征数。
feature_names_in_shape 为 (n_features_in_,) 的 ndarray: 在 fit 期间看到的特征名称。仅当 X 具有全部为字符串的特征名称时才定义。

1.0 版本新增。

另请参阅

PLSCanonical: Partial Least Squares 转换器和回归器。

示例

>>> from sklearn.cross_decomposition import PLSRegression
>>> X = [[0., 0., 1.], [1.,0.,0.], [2.,2.,2.], [2.,5.,4.]]
>>> y = [[0.1, -0.2], [0.9, 1.1], [6.2, 5.9], [11.9, 12.3]]
>>> pls2 = PLSRegression(n_components=2)
>>> pls2.fit(X, y)
PLSRegression()
>>> y_pred = pls2.predict(X)

For a comparison between PLS Regression and PCA, see Principal Component Regression vs Partial Least Squares Regression.

fit(X, y)[source]#

用数据拟合模型。

参数:

Xshape 为 (n_samples, n_features) 的 array-like: 训练向量，其中 n_samples 是样本数，n_features 是预测变量数。
yshape 为 (n_samples,) 或 (n_samples, n_targets) 的 array-like: 目标向量，其中 n_samples 是样本数，n_targets 是响应变量数。

返回:

selfobject: 拟合的模型。

fit_transform(X, y=None)[source]#

学习并应用训练数据的降维。

参数:

Xshape 为 (n_samples, n_features) 的 array-like: 训练向量，其中 n_samples 是样本数，n_features 是预测变量数。
yarray-like of shape (n_samples, n_targets), default=None: 目标向量，其中 n_samples 是样本数，n_targets 是响应变量数。

返回:

selfndarray of shape (n_samples, n_components): 如果未给定 y，则返回 x_scores，否则返回 (x_scores, y_scores)。

get_feature_names_out(input_features=None)[source]#

获取转换的输出特征名称。

The feature names out will prefixed by the lowercased class name. For example, if the transformer outputs 3 features, then the feature names out are: ["class_name0", "class_name1", "class_name2"].

参数:

input_featuresarray-like of str or None, default=None: Only used to validate feature names with the names seen in fit.

返回:

feature_names_outstr 对象的 ndarray: 转换后的特征名称。

get_metadata_routing()[source]#

获取此对象的元数据路由。

请查阅用户指南，了解路由机制如何工作。

返回:

routingMetadataRequest: 封装路由信息的 MetadataRequest。

get_params(deep=True)[source]#

获取此估计器的参数。

参数:

deepbool, default=True: 如果为 True，将返回此估计器以及包含的子对象（如果它们是估计器）的参数。

返回:

paramsdict: 参数名称映射到其值。

inverse_transform(X, y=None)[source]#

将数据转换回其原始空间。

参数:

Xarray-like of shape (n_samples, n_components): 新数据，其中 n_samples 是样本数，n_components 是 pls 组件数。
yarray-like of shape (n_samples,) or (n_samples, n_components): 新目标，其中 n_samples 是样本数，n_components 是 pls 组件数。

返回:

X_original形状为 (n_samples, n_features) 的 ndarray: 返回重建的 X 数据。
y_originalndarray of shape (n_samples, n_targets): 返回重建的 X 目标。仅在给定 y 时返回。

注意事项

此转换仅在 n_components=n_features 时才完全准确。

predict(X, copy=True)[source]#

预测给定样本的目标。

参数:

Xshape 为 (n_samples, n_features) 的 array-like: 样本。
copy布尔值, 默认为 True: 是否复制 X 或执行就地归一化。

返回:

y_predndarray of shape (n_samples,) or (n_samples, n_targets): 返回预测值。

注意事项

此调用需要估计形状为 (n_features, n_targets) 的矩阵，这在高维空间中可能是一个问题。

score(X, y, sample_weight=None)[source]#

返回测试数据的决定系数。

The coefficient of determination, $R^2$, is defined as $(1 - \frac{u}{v})$, where $u$ is the residual sum of squares ((y_true - y_pred)** 2).sum() and $v$ is the total sum of squares ((y_true - y_true.mean()) ** 2).sum(). The best possible score is 1.0 and it can be negative (because the model can be arbitrarily worse). A constant model that always predicts the expected value of y, disregarding the input features, would get a $R^2$ score of 0.0.

参数:

Xshape 为 (n_samples, n_features) 的 array-like: 测试样本。对于某些估计器，这可能是一个预先计算的核矩阵或一个通用对象列表，形状为 (n_samples, n_samples_fitted)，其中 n_samples_fitted 是用于估计器拟合的样本数。
yshape 为 (n_samples,) 或 (n_samples, n_outputs) 的 array-like: X 的真实值。
sample_weightshape 为 (n_samples,) 的 array-like, default=None: 样本权重。

返回:

scorefloat: self.predict(X) 相对于 y 的 $R^2$。

注意事项

The $R^2$ score used when calling score on a regressor uses multioutput='uniform_average' from version 0.23 to keep consistent with default value of r2_score. This influences the score method of all the multioutput regressors (except for MultiOutputRegressor).

set_output(*, transform=None)[source]#

设置输出容器。

有关如何使用 API 的示例，请参阅引入 set_output API。

参数:

transform{“default”, “pandas”, “polars”}, default=None

配置 transform 和 fit_transform 的输出。

"default": 转换器的默认输出格式
"pandas": DataFrame 输出
"polars": Polars 输出
None: 转换配置保持不变

1.4 版本新增: 添加了 "polars" 选项。

返回:

selfestimator instance: 估计器实例。

set_params(**params)[source]#

设置此估计器的参数。

此方法适用于简单的估计器以及嵌套对象（如 Pipeline）。后者具有 <component>__<parameter> 形式的参数，以便可以更新嵌套对象的每个组件。

参数:

**paramsdict: 估计器参数。

返回:

selfestimator instance: 估计器实例。

set_predict_request(*, copy: bool | None | str = '$UNCHANGED$') → PLSRegression[source]#

配置是否应请求元数据以传递给 predict 方法。

请注意，此方法仅在以下情况下相关：此估计器用作元估计器中的子估计器，并且通过 enable_metadata_routing=True 启用了元数据路由（请参阅 sklearn.set_config）。请查看用户指南以了解路由机制的工作原理。

每个参数的选项如下：

True：请求元数据，如果提供则传递给 predict。如果未提供元数据，则忽略该请求。
False：不请求元数据，并且元估计器不会将其传递给 predict。
None：不请求元数据，如果用户提供元数据，元估计器将引发错误。
str：应将元数据以给定别名而不是原始名称传递给元估计器。

默认值 (sklearn.utils.metadata_routing.UNCHANGED) 保留现有请求。这允许您更改某些参数的请求而不更改其他参数。

在版本 1.3 中新增。

参数:

copystr, True, False, or None, default=sklearn.utils.metadata_routing.UNCHANGED: predict 中 copy 参数的元数据路由。

返回:

selfobject: 更新后的对象。

set_score_request(*, sample_weight: bool | None | str = '$UNCHANGED$') → PLSRegression[source]#

配置是否应请求元数据以传递给 score 方法。

请注意，此方法仅在以下情况下相关：此估计器用作元估计器中的子估计器，并且通过 enable_metadata_routing=True 启用了元数据路由（请参阅 sklearn.set_config）。请查看用户指南以了解路由机制的工作原理。

每个参数的选项如下：

True：请求元数据，如果提供则传递给 score。如果未提供元数据，则忽略该请求。
False：不请求元数据，元估计器不会将其传递给 score。
None：不请求元数据，如果用户提供元数据，元估计器将引发错误。
str：应将元数据以给定别名而不是原始名称传递给元估计器。

默认值 (sklearn.utils.metadata_routing.UNCHANGED) 保留现有请求。这允许您更改某些参数的请求而不更改其他参数。

在版本 1.3 中新增。

参数:

sample_weightstr, True, False, or None, default=sklearn.utils.metadata_routing.UNCHANGED: score 方法中 sample_weight 参数的元数据路由。

返回:

selfobject: 更新后的对象。

set_transform_request(*, copy: bool | None | str = '$UNCHANGED$') → PLSRegression[source]#

配置是否应请求元数据以传递给 transform 方法。

请注意，此方法仅在以下情况下相关：此估计器用作元估计器中的子估计器，并且通过 enable_metadata_routing=True 启用了元数据路由（请参阅 sklearn.set_config）。请查看用户指南以了解路由机制的工作原理。

每个参数的选项如下：

True：请求元数据，如果提供则传递给 transform。如果未提供元数据，则忽略该请求。
False：不请求元数据，并且元估计器不会将其传递给 transform。
None：不请求元数据，如果用户提供元数据，元估计器将引发错误。
str：应将元数据以给定别名而不是原始名称传递给元估计器。

默认值 (sklearn.utils.metadata_routing.UNCHANGED) 保留现有请求。这允许您更改某些参数的请求而不更改其他参数。

在版本 1.3 中新增。

参数:

copystr, True, False, or None, default=sklearn.utils.metadata_routing.UNCHANGED: transform 中 copy 参数的元数据路由。

返回:

selfobject: 更新后的对象。

transform(X, y=None, copy=True)[source]#

应用降维。

参数:

Xshape 为 (n_samples, n_features) 的 array-like: 要转换的样本。
yarray-like of shape (n_samples, n_targets), default=None: 目标向量。
copy布尔值, 默认为 True: 是否复制 X 和 y，或执行就地归一化。

返回:

x_scores, y_scoresarray-like or tuple of array-like: 如果未给定 y，则返回 x_scores，否则返回 (x_scores, y_scores)。

Gallery examples#

比较交叉分解方法

主成分回归 vs 偏最小二乘回归

PLSRegression#

Gallery examples#

本页