问题描述
from sklearn.feature_extraction.text import TfidfVectorizer
vectorizer = TfidfVectorizer()
X = vectorizer.fit_transform(FinalList)
print(vectorizer.get_feature_names())
print(X.shape)
data["sentiment"]=data["sentiment"].map(dict(positive=1,negative=0))
data.head(5)
y= data["sentiment"]
x_train,x_test,y_train,y_test = train_test_split(X,y,test_size=0.2,random_state=2020)
clf = MultinomialNB()
clf.fit(x_train,y_train)
y_head_train = clf.predict(x_train)
y_head_test=clf.predict(x_test)
print(accuracy_score(y_head_train,y_train))
print(accuracy_score(y_head_test,y_test))
Newlist=Tokenization(post_df['text'])
Newlistlist = list(filter(None,Newlist))
vectorizer = TfidfVectorizer(vocabulary=vectorizer.vocabulary_)
NewText = vectorizer.fit_transform(Newlist)
print(vectorizer.get_feature_names())
print(NewText.shape)
y_NewText = clf.predict(NewText)
我遇到的问题与下面相同,我该如何解决尺寸不匹配错误..?谢谢。 scikit-learn ValueError: dimension mismatch
解决方法
暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!
如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。
小编邮箱:dio#foxmail.com (将#修改为@)