简体   繁体   English

JS Puppeteer - 如何刮桌子

[英]JS Puppeteer - How to scrape a table

I am not too knowledgeable with Javascript or Puppeteer and I was wondering how to scrape the following table:我对 Javascript 或 Puppeteer 不太了解,我想知道如何抓取下表:

我想刮的桌子

I want to scrape everything within body which are all <tr>...</tr> which have <td class="className">...</td>我想刮掉身体内的所有内容,这些内容都是<tr>...</tr> ,其中有<td class="className">...</td> 在此处输入图片说明

Any solutions?任何解决方案?

let text = await page.evaluate (() => document.querySelector('tbody').innerText)

That might look something like:这可能看起来像:

await page.evaluate (() => {
  return [...document.querySelectorAll('.tbrdr tbody tr')].map(tr => {
    return [...tr.querySelectorAll('td.className')].map(td => td.innerText)
  })
})

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM