繁体   English   中英

如何从Google趋势中提取标题/文本并通过Selenium和Python进行打印

[英]How to extract titles/text from Google Trends and print them through Selenium and Python

我想从这个网站中提取每行中的不同标题:

https://trends.google.com/trends/trendingsearches/realtime?geo=AR&category=all

我尝试了一些没有运气的尝试。 我认为通过按类搜索元素我会得到所需的文本:

from selenium import webdriver
driver=webdriver.Chrome('path to bin')
driver.get('https://trends.google.com/trends/trendingsearches/realtime?geo=AR&category=all')
hrefs = driver.find_elements_by_class_name('title')
print hrefs
print(len(hrefs))
driver.quit()

先谢谢你们!

你真是太近了! 你只需要从标题中获取文本,试试这个:

from selenium import webdriver

driver=webdriver.Chrome('path to bin')
driver.get('https://trends.google.com/trends/trendingsearches/realtime?geo=AR&category=all')
Titles = driver.find_elements_by_class_name('title')
for title in Titles:
    print(title.text)
driver.quit()

@ PixelEinstein的答案将满足您的要求。 但作为最佳实践的一部分,您应始终最大化 浏览器窗口并诱导WebDriverWait 首先显示元素,然后按如下方式提取文本:

  • 代码块:

     from selenium import webdriver from selenium.webdriver.common.by import By from selenium.webdriver.support.ui import WebDriverWait from selenium.webdriver.support import expected_conditions as EC options = webdriver.ChromeOptions() options.add_argument("start-maximized") options.add_argument('disable-infobars') driver=webdriver.Chrome(chrome_options=options, executable_path=r'C:\\Utility\\BrowserDrivers\\chromedriver.exe') driver.get('https://trends.google.com/trends/trendingsearches/realtime?geo=AR&category=all') titles = WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.XPATH, "//div[@class='title']"))) for title in titles: print(title.text) driver.quit() 
  • 控制台输出:

     Mauricio Macri • Cyst • Pancreas Abortion • National Congress of Argentina • Debate Abortion • Mayra Mendoza • Argentine Chamber of Deputies • Deputy 

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM