如何使用 <br>

问题描述

我正在尝试使用 RSelenium 获取元素的文本。我正在用传单地图抓取主页:

library(rvest)
library(RSelenium)

remDr <- remoteDriver(
  remoteServerAddr = "localhost",port = 4444,browserName = "chrome"
)

remDr$open()
remDr$getStatus()

homepage <- c("https://schulfinder.kultus-bw.de/?q=P3Rlcm09JmRpc3RhbmNlPSZ0eXBlcz0zJnNjaG9vbF9raW5kPTEmc2Nob29sX2tpbmQ9MiZzY2hvb2xfa2luZD0zJnNjaG9vbF9raW5kPTQmc2Nob29sX2tpbmQ9NSZzY2hvb2xfa2luZD02JnNjaG9vbF9raW5kPTcmd29ya19zY2hlZHVsZT0yJndvcmtfc2NoZWR1bGU9MSZvd25lcj0m")

remDr$navigate(homepage)
Sys.sleep(5)

我可以获得整个块“p”的文本(它给了我所有 3 行的内容):

        path <- paste("/html/body/div/div/div[5]/table[1]/tbody/tr[",1+1,sep ="")
path <- paste(path,"]",sep = "")
webElem <-remDr$findElement(using = "xpath",path)
webElem$clickElement()
popup <- remDr$findElement(using = "xpath","/html/body/div/div/div[4]/div/div[1]/div[2]/div[4]/div")
element <- remDr$findElement(using = "xpath","/html/body/div/div/div[4]/div/div[1]/div[2]/div[4]/div/div[1]/div/p[1]")
    > adress <- element$getElementText()
    > adress
    [[1]]
    [1] "Justus-von-Liebig-Schule Aalen Hauswirtschaftliche und Landwirtschaftliche Schule\nHauptstelle\nÖffentliche Schule"

我知道我可以把整个部分(“p”)切成碎片:

adresslist <- strsplit(adress[[1]],"\n",fixed = FALSE)
> adresslist
[[1]]
[1] "Justus-von-Liebig-Schule Aalen Hauswirtschaftliche und Landwirtschaftliche Schule"
[2] "Hauptstelle"                                                                      
[3] "Öffentliche Schule"

我宁愿自己处理每一行,但我对 XPATH 很陌生,找不到正确的方法

element <- remDr$findElement(using = "xpath","/html/body/div/div/div[4]/div/div[1]/div[2]/div[4]/div/div[1]/div/p[1]/br[1]")
> art <- element$getElementText()
> art
[[1]]
[1] ""

> element <- remDr$findElement(using = "xpath","/html/body/div/div/div[4]/div/div[1]/div[2]/div[4]/div/div[1]/div/p[1]/br")
> art <- element$getElementText()
> art
[[1]]
[1] ""

解决方法

暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!

如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@)