繁体   English   中英

如何从html节点获取属性值<a>?</a>

[英]How to get an attribute value from html node <a>?

我正在尝试抓取以下网站,但没有获得“数据链接”属性的值。

http://www.apptrace.com/itunes/charts/FRA/topfreeapplications/36/2018-12-27

有人可以帮我吗?

//尝试#1(错误)

const puppeteer = require('puppeteer')

let scrape = async () => {
    const browser = await puppeteer.launch({headless: true})
    const page = await browser.newPage()
    await page.goto('http://www.apptrace.com/itunes/charts/USA/topfreeapplications/36')
    await page.waitFor(1000)

    const countryCharts = await page.evaluate(() => {

    const abbrAppsCountry = []
    document.getElementById('#current_storefront_list')
        .getAttribute('li > a[data-link]')
        .forEach(app => abbrAppsCountry.push(app.value))

        return abbrAppsCountry

    })

browser.close()
return countryCharts
}

scrape().then((value) => {
    console.log(value)
})

//尝试#2(空数组)

const puppeteer = require('puppeteer')

let scrape = async () => {
    const browser = await puppeteer.launch({headless: true})
    const page = await browser.newPage()
    await page.goto('http://www.apptrace.com/itunes/charts/USA/topfreeapplications/36')
    await page.waitFor(1000)

    const countryCharts = await page.evaluate(() => {

    const abbrAppsCountry = []
    document.querySelectorAll('#current_storefront_list > li > a[data-link]')
        .forEach(app => abbrAppsCountry.push(app.value))

        return abbrAppsCountry

    })

browser.close()
return countryCharts
}

scrape().then((value) => {
console.log(value)
})

我想得到国家名称的缩写。

您可以使用datasetgetAttribute API:

document.querySelectorAll('#current_storefront_list > li > a')
        .forEach(app => abbrAppsCountry.push(app.dataset.link))

要么:

document.querySelectorAll('#current_storefront_list > li > a')
        .forEach(app => abbrAppsCountry.push(app.getAttribute('data-link')))

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM