谷歌表 ImportXML 失败

Question

This one works:这个有效：

=importxml("https://discgolfmetrix.com/?u=scorecard&ID=900113&view=result", "//table[@class='data data-hover']/tr/td[2]")

This one fails:这个失败了：

=importxml("https://discgolfmetrix.com/?u=scorecard&ID=1172639&view=result", "//table[@class='data data-hover']/tr/td[2]")

If it was the other way around I could understand it, since the first one has 2 tbody tags.如果反过来我可以理解，因为第一个有 2 个 tbody 标签。

Answer 1

GoogleSheets parses the page in its own way (parent >> child structure is not exactly the same as in your browser). GoogleSheets 以自己的方式解析页面（父 >> 子结构与浏览器中的不完全相同）。 Use //tr in your XPath to circumvent parsing errors:在 XPath 中使用//tr来规避解析错误：

=IMPORTXML("https://discgolfmetrix.com/?u=scorecard&ID=1172639&view=result","//table[@class='data data-hover']//tr/td[2]")

Or use IMPORTHMTL and QUERY :或使用IMPORTHMTL和QUERY ：

=QUERY(IMPORTHTML("https://discgolfmetrix.com/?u=scorecard&ID=1172639&view=result","table",1),"select Col2 OFFSET 1")

Output: Output：

EDIT : More details: EDIT ：更多细节：

For the first link, the parsed HTML structure is the following one:对于第一个链接，解析出来的HTML结构如下：

<table>
    <tr>    
        <td></td>
        <td>your_data</td>
        ...
    </tr>
    <tr>    
        <td></td>
        <td>your_data</td>
        ...
    </tr>
    ...
</table>

And your XPath works.你的 XPath 工作正常。

For the second link, there's a preceding tbody element which contains the tr elements.对于第二个链接，前面的tbody元素包含tr元素。 The structure is:结构是：

<table>
    <tbody>     
        <tr>    
            <td></td>
            <td>your_data</td>
            ...
        </tr>
        <tr>    
            <td></td>
            <td>your_data</td>
            ...
        </tr>
        ...
    </tbody>
</table>

And your XPath fails.你的 XPath 失败了。 That's why you have to use // or declare the tbody element in your expression:这就是为什么您必须使用//或在表达式中声明tbody元素的原因：

=IMPORTXML("https://discgolfmetrix.com/?u=scorecard&ID=1172639&view=result","//table[@class='data data-hover']/tbody/tr/td[2]")

谷歌表 ImportXML 失败

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-07-13 15:38:02

谷歌表 ImportXML 失败

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-07-13 15:38:02

解决方案1
1 已采纳 2020-07-13 15:38:02