因此,我一直在尝试将我的数据集拆分为 70-30 的比例train_test_split,以便使用sklearnPLS 解决问题。但是,我遇到了一个错误,上面写着:
# Read data
df = pd.read_csv(
filepath_or_buffer='colontumor.csv',
header=None,
sep=',')
df.tail()
X = df.iloc[:, :-1].values
# Split data to train and test on 70-30 ratio
X_train, X_test = train_test_split(
X, test_size=0.30, random_state=None)
pls = PLSRegression(n_components=2)
pls.fit(X_train, X_test)
Y_pred = pls2.predict(X_train)
并以某种方式遇到了这个错误:
Line (17): pls.fit(X_train, X_test)
ValueError: Found input variables with inconsistent numbers of samples: [43, 19]
有什么解决办法吗?我已经转了好几个小时了。