问题描述
我正在对数据运行 eval.automl,创建了一个类列,如下所示:
df.loc[(df.quality<6),'flag_class'] = 1
df.loc[(df.quality==6),'flag_class'] = 2
df.loc[(df.quality>6),'flag_class'] = 3
然后拆分如下:
X = df[['里面有几列']].copy() y = df[['flag_class']].copy()
X_train,X_holdout,y_train,y_holdout = evalml.preprocessing.split_data(X,y,problem_type = 'multiclass')
错误:
AttributeError Traceback (most recent call last)
<ipython-input-37-dffcb1214932> in <module>
----> 1 X_train,problem_type = 'multiclass')
~\AppData\Roaming\Python\python38\site-packages\evalml\preprocessing\utils.py in split_data(X,problem_type,problem_configuration,test_size,random_seed)
75 data_splitter = StratifiedShuffleSplit(n_splits=1,test_size=test_size,random_state=random_seed)
76
---> 77 train,test = next(data_splitter.split(X.to_dataframe(),y.to_series()))
78
79 X_train = X.iloc[train]
AttributeError: 'DataTable' object has no attribute 'to_series'
任何支持将不胜感激,提前致谢
解决方法
DataTable 是来自 woodwork 框架的一个类。该框架建立在 Pandas Dataframe 之上。