JTidy Node.findBody（） - 如何使用？

Question

I'm trying to do XHTML DOM parsing with JTidy, and it seems to be rather counterintuitive task. 我正在尝试用JTidy进行XHTML DOM解析，这似乎是违反直觉的任务。 In particular, there's a method to parse HTML: 特别是，有一种解析HTML的方法：

Node Tidy.parse(Reader, Writer)

And to get the <body /> of that Node, I assume, I should use 为了获得该节点的<body />，我认为，我应该使用

Node Node.findBody(TagTable)

Where should I get an instance of that TagTable? 我应该在哪里获得该TagTable的实例？ (Constructor is protected, and I haven't found a factory to produce it.) （构造函数受到保护，我还没有找到工厂来生产它。）

I use JTidy 8.0-SNAPSHOT. 我使用JTidy 8.0-SNAPSHOT。

Answer 1

I found there's much simpler method to extract the body: 我发现有更简单的方法来提取身体：

tidy = new Tidy();
tidy.setXHTML(true);
tidy.setPrintBodyOnly(true);

And then use tidy on the Reader-Writer pair. 然后在Reader-Writer对上使用整洁。

Simple as it should be. 应该是简单的。

Answer 2

You could use the parseDOM method instead, which would give you a org.w3c.dom.Document back: 您可以使用parseDOM方法，这将为您提供org.w3c.dom.Document ：

Document document = Tidy.parseDOM(reader, writer);
Node body = document.getElementsByTagName("body").item(0);

JTidy Node.findBody（） - 如何使用？

问题描述

2 个解决方案

解决方案1
6 已采纳 2008-10-21 10:30:38

解决方案2
3 2008-10-21 09:47:27

JTidy Node.findBody（） - 如何使用？

问题描述

2 个解决方案

解决方案1 6 已采纳 2008-10-21 10:30:38

解决方案2 3 2008-10-21 09:47:27

解决方案1
6 已采纳 2008-10-21 10:30:38

解决方案2
3 2008-10-21 09:47:27