简体繁体 English

jTidy和TagSoup文档

[英]jTidy and TagSoup documentation

原文 2010-12-15 16:49:58 6 1 java/ jtidy/ tag-soup/ jericho-html-parser

I'm looking for documentation (officially documentation if it is possible) for TagSoup and jTidy libraries. 我正在寻找TagSoup和jTidy库的文档（如果可能的话，正式文档）。

I want use this libraries to manipulate html "tagsoup" files that include xml tags with different namespaces mixed between html (html, xhtml or html5) tags. 我想使用这个库来操作html“tagsoup”文件，其中包含混合在html（html，xhtml或html5）标记之间的不同命名空间的xml标记。

I have tested HTMLCleaner, NekoHTML and Jericho, but i don't find documentation for jTidy and TagSoup, apart from simplest examples to clear a file. 我测试了HTMLCleaner，NekoHTML和Jericho，但我找不到jTidy和TagSoup的文档，除了清除文件的最简单的例子。

I need documentation about manipulate contents, replace tags, extract info, etc... 我需要有关操纵内容，替换标签，提取信息等的文档......

Thanks 谢谢

Note: After test all options, I used StAX / Woodstox : 注意：测试完所有选项后，我使用了StAX / Woodstox ：

1 个解决方案

The answer to a similar question on the tagsoup-friends google group may help: 关于tagsoup-friends google群组的类似问题的答案可能会有所帮助：

Documentation for TagSoup TagSoup的文档

You've probably already seen them, but the javadoc for JTidy is available here: http://jtidy.sourceforge.net/apidocs/index.html 您可能已经看过它们了，但JTidy的javadoc可以在这里找到： http ： //jtidy.sourceforge.net/apidocs/index.html