|
|||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||||
java.lang.Objectorg.apache.xerces.util.ParserConfigurationSettings
org.cyberneko.html.HTMLConfiguration
An XNI-based parser configuration that can be used to parse HTML documents. This configuration can be used directly in order to parse HTML documents or can be used in conjunction with any XNI based tools, such as the Xerces2 implementation.
This configuration recognizes the following features:
This configuration recognizes the following properties:
For complete usage information, refer to the documentation.
HTMLScanner,
HTMLTagBalancer,
HTMLErrorReporter| Nested Class Summary | |
protected class |
HTMLConfiguration.ErrorReporter
Defines an error reporter for reporting HTML errors. |
| Field Summary | |
protected static java.lang.String |
AUGMENTATIONS
Include infoset augmentations. |
protected static java.lang.String |
BALANCE_TAGS
Balance tags. |
protected static java.lang.String |
ERROR_DOMAIN
Error domain. |
protected static java.lang.String |
ERROR_REPORTER
Error reporter. |
protected boolean |
fCloseStream
Stream opened by parser. |
protected org.apache.xerces.xni.XMLDocumentHandler |
fDocumentHandler
Document handler. |
protected HTMLScanner |
fDocumentScanner
Document scanner. |
protected org.apache.xerces.xni.XMLDTDContentModelHandler |
fDTDContentModelHandler
DTD content model handler. |
protected org.apache.xerces.xni.XMLDTDHandler |
fDTDHandler
DTD handler. |
protected org.apache.xerces.xni.parser.XMLEntityResolver |
fEntityResolver
Entity resolver. |
protected org.apache.xerces.xni.parser.XMLErrorHandler |
fErrorHandler
Error handler. |
protected HTMLErrorReporter |
fErrorReporter
Error reporter. |
protected java.util.Vector |
fHTMLComponents
Components. |
protected static java.lang.String |
FILTERS
Pipeline filters. |
protected java.util.Locale |
fLocale
Locale. |
protected NamespaceBinder |
fNamespaceBinder
Namespace binder. |
protected HTMLTagBalancer |
fTagBalancer
HTML tag balancer. |
protected static java.lang.String |
NAMES_ATTRS
Modify HTML attribute names: { "upper", "lower", "default" }. |
protected static java.lang.String |
NAMES_ELEMS
Modify HTML element names: { "upper", "lower", "default" }. |
protected static java.lang.String |
NAMESPACES
Namespaces. |
protected static java.lang.String |
REPORT_ERRORS
Report errors. |
protected static java.lang.String |
SIMPLE_ERROR_FORMAT
Simple report format. |
protected static boolean |
XERCES_2_0_0
Parser version is Xerces 2.0.0. |
protected static boolean |
XERCES_2_0_1
Parser version is Xerces 2.0.1. |
protected static boolean |
XML4J_4_0_x
Parser version is XML4J 4.0.x. |
| Fields inherited from class org.apache.xerces.util.ParserConfigurationSettings |
fFeatures, fParentSettings, fProperties, fRecognizedFeatures, fRecognizedProperties, PARSER_SETTINGS |
| Constructor Summary | |
HTMLConfiguration()
Default constructor. |
|
| Method Summary | |
protected void |
addComponent(HTMLComponent component)
Adds a component. |
void |
cleanup()
If the application decides to terminate parsing before the xml document is fully parsed, the application should call this method to free any resource allocated during parsing. |
org.apache.xerces.xni.XMLDocumentHandler |
getDocumentHandler()
Returns the document handler. |
org.apache.xerces.xni.XMLDTDContentModelHandler |
getDTDContentModelHandler()
Returns the DTD content model handler. |
org.apache.xerces.xni.XMLDTDHandler |
getDTDHandler()
Returns the DTD handler. |
org.apache.xerces.xni.parser.XMLEntityResolver |
getEntityResolver()
Returns the entity resolver. |
org.apache.xerces.xni.parser.XMLErrorHandler |
getErrorHandler()
Returns the error handler. |
java.util.Locale |
getLocale()
Returns the locale. |
boolean |
parse(boolean complete)
Parses the document in a pull parsing fashion. |
void |
parse(org.apache.xerces.xni.parser.XMLInputSource source)
Parses a document. |
void |
pushInputSource(org.apache.xerces.xni.parser.XMLInputSource inputSource)
Pushes an input source onto the current entity stack. |
protected void |
reset()
Resets the parser configuration. |
void |
setDocumentHandler(org.apache.xerces.xni.XMLDocumentHandler handler)
Sets the document handler. |
void |
setDTDContentModelHandler(org.apache.xerces.xni.XMLDTDContentModelHandler handler)
Sets the DTD content model handler. |
void |
setDTDHandler(org.apache.xerces.xni.XMLDTDHandler handler)
Sets the DTD handler. |
void |
setEntityResolver(org.apache.xerces.xni.parser.XMLEntityResolver resolver)
Sets the entity resolver. |
void |
setErrorHandler(org.apache.xerces.xni.parser.XMLErrorHandler handler)
Sets the error handler. |
void |
setFeature(java.lang.String featureId,
boolean state)
Sets a feature. |
void |
setInputSource(org.apache.xerces.xni.parser.XMLInputSource inputSource)
Sets the input source for the document to parse. |
void |
setLocale(java.util.Locale locale)
Sets the locale. |
void |
setProperty(java.lang.String propertyId,
java.lang.Object value)
Sets a property. |
| Methods inherited from class org.apache.xerces.util.ParserConfigurationSettings |
addRecognizedFeatures, addRecognizedProperties, checkFeature, checkProperty, getFeature, getProperty |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Methods inherited from interface org.apache.xerces.xni.parser.XMLParserConfiguration |
addRecognizedFeatures, addRecognizedProperties, getFeature, getProperty |
| Field Detail |
protected static final java.lang.String NAMESPACES
protected static final java.lang.String AUGMENTATIONS
protected static final java.lang.String REPORT_ERRORS
protected static final java.lang.String SIMPLE_ERROR_FORMAT
protected static final java.lang.String BALANCE_TAGS
protected static final java.lang.String NAMES_ELEMS
protected static final java.lang.String NAMES_ATTRS
protected static final java.lang.String FILTERS
protected static final java.lang.String ERROR_REPORTER
protected static final java.lang.String ERROR_DOMAIN
protected org.apache.xerces.xni.XMLDocumentHandler fDocumentHandler
protected org.apache.xerces.xni.XMLDTDHandler fDTDHandler
protected org.apache.xerces.xni.XMLDTDContentModelHandler fDTDContentModelHandler
protected org.apache.xerces.xni.parser.XMLErrorHandler fErrorHandler
protected org.apache.xerces.xni.parser.XMLEntityResolver fEntityResolver
protected java.util.Locale fLocale
protected boolean fCloseStream
protected java.util.Vector fHTMLComponents
protected HTMLScanner fDocumentScanner
protected HTMLTagBalancer fTagBalancer
protected NamespaceBinder fNamespaceBinder
protected HTMLErrorReporter fErrorReporter
protected static boolean XERCES_2_0_0
protected static boolean XERCES_2_0_1
protected static boolean XML4J_4_0_x
| Constructor Detail |
public HTMLConfiguration()
| Method Detail |
public void pushInputSource(org.apache.xerces.xni.parser.XMLInputSource inputSource)
Hint: To use this feature to insert the output of <SCRIPT> tags, remember to buffer the entire output of the processed instructions before pushing a new input source. Otherwise, events may appear out of sequence.
inputSource - The new input source to start scanning.
public void setFeature(java.lang.String featureId,
boolean state)
throws org.apache.xerces.xni.parser.XMLConfigurationException
setFeature in interface org.apache.xerces.xni.parser.XMLParserConfigurationorg.apache.xerces.xni.parser.XMLConfigurationException
public void setProperty(java.lang.String propertyId,
java.lang.Object value)
throws org.apache.xerces.xni.parser.XMLConfigurationException
setProperty in interface org.apache.xerces.xni.parser.XMLParserConfigurationorg.apache.xerces.xni.parser.XMLConfigurationExceptionpublic void setDocumentHandler(org.apache.xerces.xni.XMLDocumentHandler handler)
setDocumentHandler in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic org.apache.xerces.xni.XMLDocumentHandler getDocumentHandler()
getDocumentHandler in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic void setDTDHandler(org.apache.xerces.xni.XMLDTDHandler handler)
setDTDHandler in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic org.apache.xerces.xni.XMLDTDHandler getDTDHandler()
getDTDHandler in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic void setDTDContentModelHandler(org.apache.xerces.xni.XMLDTDContentModelHandler handler)
setDTDContentModelHandler in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic org.apache.xerces.xni.XMLDTDContentModelHandler getDTDContentModelHandler()
getDTDContentModelHandler in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic void setErrorHandler(org.apache.xerces.xni.parser.XMLErrorHandler handler)
setErrorHandler in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic org.apache.xerces.xni.parser.XMLErrorHandler getErrorHandler()
getErrorHandler in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic void setEntityResolver(org.apache.xerces.xni.parser.XMLEntityResolver resolver)
setEntityResolver in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic org.apache.xerces.xni.parser.XMLEntityResolver getEntityResolver()
getEntityResolver in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic void setLocale(java.util.Locale locale)
setLocale in interface org.apache.xerces.xni.parser.XMLParserConfigurationpublic java.util.Locale getLocale()
getLocale in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public void parse(org.apache.xerces.xni.parser.XMLInputSource source)
throws org.apache.xerces.xni.XNIException,
java.io.IOException
parse in interface org.apache.xerces.xni.parser.XMLParserConfigurationorg.apache.xerces.xni.XNIException
java.io.IOException
public void setInputSource(org.apache.xerces.xni.parser.XMLInputSource inputSource)
throws org.apache.xerces.xni.parser.XMLConfigurationException,
java.io.IOException
setInputSource in interface org.apache.xerces.xni.parser.XMLPullParserConfigurationinputSource - The document's input source.
org.apache.xerces.xni.parser.XMLConfigurationException - Thrown if there is a
configuration error when initializing the
parser.
java.io.IOException - Thrown on I/O error.parse(boolean)
public boolean parse(boolean complete)
throws org.apache.xerces.xni.XNIException,
java.io.IOException
parse in interface org.apache.xerces.xni.parser.XMLPullParserConfigurationcomplete - True if the pull parser should parse the
remaining document completely.
org.apache.xerces.xni.XNIException - Any XNI exception, possibly wrapping
another exception.
java.io.IOException - An IO exception from the parser, possibly
from a byte stream or character stream
supplied by the parser.setInputSource(org.apache.xerces.xni.parser.XMLInputSource)public void cleanup()
cleanup in interface org.apache.xerces.xni.parser.XMLPullParserConfigurationprotected void addComponent(HTMLComponent component)
protected void reset()
throws org.apache.xerces.xni.parser.XMLConfigurationException
org.apache.xerces.xni.parser.XMLConfigurationException
|
|||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||||