htmlcleaner惯用法
Common usage
Tipically the following steps are taken:
// create an instance of HtmlCleanerHtmlCleaner cleaner = new HtmlCleaner();?// take default cleaner propertiesCleanerProperties props = cleaner.getProperties();?// customize cleaner's behaviour with property settersprops.setXXX(...);?// Clean HTML taken from simple string, file, URL, input stream, // input source or reader. Result is root node of created // tree-like structure. Single cleaner instance may be safely used// multiple times.TagNode node = cleaner.clean(...);?// optionally find parts of the DOM or modify some nodesTagNode[] myNodes = node.getElementsByXXX(...);// and/orObject[] myNodes = node.evaluateXPath(xPathExpression);// and/oraNode.removeFromTree();// and/oraNode.addAttribute(attName, attValue);// and/oraNode.removeAttribute(attName, attValue);// and/orcleaner.setInnerHtml(aNode, htmlContent);?// serialize a node to a file, output stream, DOM, JDom...new XXXSerializer(props).writeXmlXXX(aNode, ...);myJDom = new JDomSerializer(props, true).createJDom(aNode);myDom = new DomSerializer(props, true).createDOM(aNode);