问题描述
df = pd.DataFrame({'Body_Q': [['p','starting','work','hobby','project','python'],['p','read','several','documentation','already']],'Price': [['p','use','buildbot','hudson','jython','development'],'sure','level','kNowledge','apologize','answer']],'A': ['NaN','NaN'],'Q': ['NaN','NaN']
})
我的想法是循环每个句子,然后循环每个单词并从 Word2vec 中捕获 Words 向量。然后它想将句子中每个单词的每个向量存储在一个数组中。
我编写了以下代码:
# first loop
for index,row in df_train.iterrows():
df_train.iloc[index]['A'] = []
#second loop
for word in df_train.iloc[index]['Body_A']:
# We check if the word is in the vocab to get vectors
if word in model.vocab:
# if yes I want to catch them and put them inside my array
np.append(df_train.iloc[index]['A'],model[word])
df_train.iloc[index]['Q'] = []
for word in df_train.iloc[index]['Body_Q']:
if word in model.vocab:
np.append(df_train.iloc[index]['Q'],model[word])
print(df_train.iloc[index]['A'])
但出于某种原因,我无法理解我的输出总是充满 nan 值:
nan
nan
nan
nan
...
我不明白我会出什么问题。你能告诉我更多吗?
解决方法
暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!
如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。
小编邮箱:dio#foxmail.com (将#修改为@)