How to Avoid StaleElementException When Scraping Amazon Search Results?

Barbara Streisand
Release: 2024-11-22 00:56:14
Original
966 people have browsed it

How to Avoid StaleElementException When Scraping Amazon Search Results?

StaleElementException in Selenium Iterations

When attempting to iterate through search results on Amazon using Selenium, users may encounter a StaleElementException when repeatedly scrolling down to load new pages. This error occurs because the element used for scrolling, bottom_bar, becomes invalid after the page reloads.

To resolve this issue and enable more reliable pagination, it is recommended to adopt a simpler approach that eliminates explicit page scrolling. Instead, Selenium can continuously click the "Next" button until it becomes disabled. This simplifies the code and ensures that the driver can consistently navigate through the results.

The updated code below implements this approach:

from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.support.ui import WebDriverWait as wait
from selenium.common.exceptions import TimeoutException

driver = webdriver.Chrome()

driver.get('https://www.amazon.com/s/ref=nb_sb_noss_1?url=search-alias%3Daps&field-keywords=sonicare+toothbrush')

while True:
    try:
        wait(driver, 10).until(EC.element_to_be_clickable((By.CSS_SELECTOR, 'a > span#pagnNextString'))).click()
    except TimeoutException:
        break
Copy after login

Note that implicitly_wait(10) does not wait for a full 10 seconds but rather "waits up to 10 seconds for an element to appear in the HTML DOM." Therefore, if the element is found within a shorter duration (e.g., 1 or 2 seconds), the waiting process is completed.

The above is the detailed content of How to Avoid StaleElementException When Scraping Amazon Search Results?. For more information, please follow other related articles on the PHP Chinese website!

source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Latest Articles by Author
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template