问题描述
由于元素的延迟加载,Python从可能的100个元素中仅选择了18个元素,在页面加载时,将仅加载18个元素,并且在滚动时将上传新元素,删除先前加载的元素,如何解决此问题它将所有100个元素存储在persons
列表中,然后单击其中的每个元素。
DOM element structure of the page
d = 0
itr = 1
time.sleep(17)
while True:
persons = browser.find_elements_by_xpath("//*[@class='i-edit mrs no-text-decoration ember-view']")
print(len(persons))
for i,person in zip(names,persons):
time.sleep(4)
persons = browser.find_elements_by_xpath("//*[@class='i-edit mrs no-text-decoration ember-view']")
if d >= len(persons):
break
i = names[d]
person = persons[d]
browser.execute_script("arguments[0].scrollIntoView(true);",person)
time.sleep(3)
person.click()
解决方法
要单击页面上的所有元素,向下滚动后将加载新元素,而旧元素将消失。
while True:
# Get screen height upon page loading for the first time
last_height = browser.execute_script("return document.body.scrollHeight")
#Get the list of elements
persons = browser.find_elements_by_xpath("//*[@class='i-edit mrs no-text-decoration ember-view']")
# Lopp through all elements and click on them
for i in range (1,len(persons)+1):
# As after every click page will be reloaded,getting fresh reference to element every time,so that wont get stale element exception
linkXpath = "(//*[@class='i-edit mrs no-text-decoration ember-view'])["+str(i)+"]"
WebDriverWait(browser,30).until(EC.element_to_be_clickable((By.XPATH,linkXpath ))).click()
# Scroll down to bottom
browser.execute_script("window.scrollTo(0,document.body.scrollHeight);")
time.sleep(2)
#Get new height of screen
new_height = browser.execute_script("return document.body.scrollHeight")
if new_height == last_height:
break
last_height = new_height