如何在 R 中创建散点图以进行主题建模?

问题描述

对于 lda 主题建模,我想添加一个散点图,它映射出地图上主题间的关系。我知道 LDAvis 函数能够做到这一点,但如果可能,我想使用替代方法,因为我的实际数据集可能包含许多主题,这不允许 LDAvis申请。

在以下网站 https://towardsdatascience.com/visualizing-topic-models-with-scatterpies-and-t-sne-f21f228f7b02 上,我找到了该网站上的代码并尝试对其进行调整,但无法使其正常工作。我需要一个图来让我创建散点图或类似的东西来进行分析。

我试图用我的改编工作的代码

top_terms %>%
    group_by(topic) %>%
    ggplot() + 
    geom_scatterpie(aes(top_terms,beta,fill = factor(topic)),color=NA,alpha=0.7) + 
    coord_equal() + 
    geom_label() + 
    ggtitle(scatterpie_Graph) + 
    xlab() + ylab() + labs(subtitle=t-SNE_Representation_of_Guided_LDA_Topics_Colored_and_Sized_by_Topic_Probability) +
    scale_fill_manual(values=colors) + 
    theme_minimal() + 
    theme(text = element_text(color=white),legend.position = none,panel.background = element_rect(fill = gray17,colour = gray17),plot.background = element_rect(fill = gray17),panel.grid.major = element_line(colour = gray25),panel.grid.minor = element_line(colour = gray25),axis.text = element_text(color=white))

虚拟数据集顶级术语:

top_terms_struct <- structure(
  list(
    topic = c(1,1,2,3,4,4),term = c("book","page","chapter","section","sports","soccer","champions","league","music","song","dj","release","movie","cinema","actress","story"),beta = c(0.9876,0.9765,0.9654,0.9543,0.8765,0.8654,0.8543,0.8432,0.8678,0.7231,0.6382,0.9846,0.9647,0.8878,0.6523)),row.names = c(NA,-16L),class = c("tbl_df","tbl","data.frame")) 

所需的输出应该是散点图或类似的东西,它映射的主题LDAvis功能相似,但最好使用不同的技术。但我对其他选择持开放态度。

提前致谢。

解决方法

暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!

如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@)