XIDEL将多个HTML文件中的提取数据（div）导出到新的HTML文件中

Question

I would like to scrap a website of multi pages and extract a particular div before exporting it to html pages with just the div as content. 我想抓取一个包含多个页面的网站并提取特定的div，然后再将其导出到仅包含div作为内容的html页面。

I am able to extract data from the content using Xidel with the following command 我可以使用Xidel通过以下命令从内容中提取数据

xidel http://someURl/ --extract //div[2]/div[2]/div -f "//a" -e //div[2]/div[2]/div

Is it possible to download the extracted data into a html file? 是否可以将提取的数据下载到html文件中？

Answer 1

添加参数：--output-format = html

XIDEL将多个HTML文件中的提取数据（div）导出到新的HTML文件中

问题描述

1 个解决方案

解决方案1
1 2016-12-19 17:11:54

XIDEL将多个HTML文件中的提取数据（div）导出到新的HTML文件中

问题描述

1 个解决方案

解决方案1 1 2016-12-19 17:11:54

解决方案1
1 2016-12-19 17:11:54