谷歌文档importXML TripAdvisor

Question

ImportXML : I am trying to scrape http://www.tripadvisor.in/Hotels-g297615-c2-Gurgaon_Haryana-Hotels.html . ImportXML：我正在尝试抓取http://www.tripadvisor.in/Hotels-g297615-c2-Gurgaon_Haryana-Hotels.html 。 I need the following fields - Name of the hotel, URL, Address, City, Pincode, no of reviews(just number) for each hotel, percentage and amenities. 我需要以下字段-酒店名称，URL，地址，城市，密码，每个酒店的评论数（仅数），百分比和便利设施。 I want this all in a single row for a single hotel. 我希望所有这一切都在单个酒店的一行中。 Can anyone help me out. 谁能帮我吗。

Google Doc Link: https://docs.google.com/spreadsheets/d/1D6X9c9uX7AltxWQ3ln0Pqqzq_CIroCkDxPYr6lv-47k/edit#gid=1666841843 Google文档链接： https : //docs.google.com/spreadsheets/d/1D6X9c9uX7AltxWQ3ln0Pqqzq_CIroCkDxPYr6lv-47k/edit#gid=1666841843

I am unable to get all the above requirements in GoogleDoc. 我无法在GoogleDoc中获得所有上述要求。 I am stuck in the address scraping just beside the URL. 我被困在URL旁边的地址抓取中。

Answer 1

Responding to your comment, you can limit data returned by the xpath to one using index ( [1] ), for example (formatted for readability): 响应您的评论，您可以使用索引（ [1] ）将xpath返回的数据限制为一个（例如，出于可读性考虑而格式化）：

=IMPORTXML(concatenate("http://www.tripadvisor.in",D2),
            "(//span[@class='street-address'])[1]")

谷歌文档importXML TripAdvisor

问题描述

1 个解决方案

解决方案1
0 2014-08-09 19:47:17

谷歌文档importXML TripAdvisor

问题描述

1 个解决方案

解决方案1 0 2014-08-09 19:47:17

解决方案1
0 2014-08-09 19:47:17