public abstract class HtmlTokenizer extends Object
It's task is to parse HTML and produce list of valid tokens: open tag tokens, end tag tokens, contents (text) and comments. As soon as new item is added to token list, cleaner is invoked to clean current list at the end.
| Constructor and Description |
|---|
HtmlTokenizer(Reader reader,
CleanerProperties props,
CleanerTransformations transformations,
ITagInfoProvider tagInfoProvider)
Constructor - cretes instance of the parser with specified content.
|
| Modifier and Type | Method and Description |
|---|---|
DoctypeToken |
getDocType() |
public HtmlTokenizer(Reader reader, CleanerProperties props, CleanerTransformations transformations, ITagInfoProvider tagInfoProvider) throws IOException
reader - props - transformations - tagInfoProvider - IOExceptionpublic DoctypeToken getDocType()
Copyright © 2006–2017. All rights reserved.