python - sklearn RFECV を使用して、推定器をフィッティングする前に次元削減ステップに渡す最適な機能を選択する方法

翻译自：https://stackoverflow.com/questions/67602546 2021-05-19T11:40:56.980

354 次

KNN を使用して推定器をフィッティングする前に、sklearn RFECV メソッドを使用して、次元削減のために LinearDiscriminantAnalysis(n_components=2) メソッドに渡す最適な機能を選択するにはどうすればよいですか。

pipeline = make_pipeline(Normalizer(), LinearDiscriminantAnalysis(n_components=2), KNeighborsClassifier(n_neighbors=10))

X = self.dataset
y = self.postures

min_features_to_select = 1  # Minimum number of features to consider
rfecv = RFECV(svc, step=1, cv=None, scoring='f1_weighted', min_features_to_select=min_features_to_select)

rfecv.fit(X, y)

print(rfecv.support_)
print(rfecv.ranking_)
print("Optimal number of features : %d" % rfecv.n_features_)

Plot number of features VS. cross-validation scores
plt.figure()
plt.xlabel("Number of features selected")
plt.ylabel("Cross validation score (nb of correct classifications)")
plt.plot(range(min_features_to_select,
len(rfecv.grid_scores_) + min_features_to_select),
rfecv.grid_scores_)
plt.show()

このコードから次のエラーが発生します。このコードを LinearDiscriminantAnalysis() ステップなしで実行すると機能しますが、これは私の処理の重要な部分です。

*** ValueError: when `importance_getter=='auto'`, the underlying estimator Pipeline should have `coef_` or `feature_importances_` attribute. Either pass a fitted estimator to feature selector or call fit before calling transform.

python - sklearn RFECV を使用して、推定器をフィッティングする前に次元削減ステップに渡す最適な機能を選択する方法

1 に答える 1

Related

Reference