简体   繁体   English

jTidy和TagSoup文档

[英]jTidy and TagSoup documentation

I'm looking for documentation (officially documentation if it is possible) for TagSoup and jTidy libraries. 我正在寻找TagSoup和jTidy库的文档(如果可能的话,正式文档)。

I want use this libraries to manipulate html "tagsoup" files that include xml tags with different namespaces mixed between html (html, xhtml or html5) tags. 我想使用这个库来操作html“tagsoup”文件,其中包含混合在html(html,xhtml或html5)标记之间的不同命名空间的xml标记。

I have tested HTMLCleaner, NekoHTML and Jericho, but i don't find documentation for jTidy and TagSoup, apart from simplest examples to clear a file. 我测试了HTMLCleaner,NekoHTML和Jericho,但我找不到jTidy和TagSoup的文档,除了清除文件的最简单的例子。

I need documentation about manipulate contents, replace tags, extract info, etc... 我需要有关操纵内容,替换标签,提取信息等的文档......

Thanks 谢谢

Note: After test all options, I used StAX / Woodstox : 注意:测试所有选项后,我使用了StAX / Woodstox

The answer to a similar question on the tagsoup-friends google group may help: 关于tagsoup-friends google群组的类似问题的答案可能会有所帮助:

Documentation for TagSoup TagSoup的文档

You've probably already seen them, but the javadoc for JTidy is available here: http://jtidy.sourceforge.net/apidocs/index.html 您可能已经看过它们了,但JTidy的javadoc可以在这里找到: http//jtidy.sourceforge.net/apidocs/index.html

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM