| Home >> All >> com >> hexidec >> ekit >> component >> [ parser Javadoc ] |
com.hexidec.ekit.component.parser: Javadoc index of package com.hexidec.ekit.component.parser.
Package Samples:
com.hexidec.ekit.component.parser
Classes:
DocumentParser: A Parser for HTML Documents (actually, you can specify a DTD, but you should really only use this class with the html dtd in swing). Reads an InputStream of HTML and invokes the appropriate methods in the ParserCallback class. This is the default parser used by HTMLEditorKit to parse HTML url's. This will message the callback for all valid tags, as well as tags that are implied but not explicitly specified. For example, the html string (<p>blah) only has a p tag defined. The callback will see the following methods: handleStartTag(html, ...) handleStartTag(head, ...) handleEndTag(head) handleStartTag(body, ...
Parser: A simple DTD-driven HTML parser. The parser reads an HTML file from an InputStream and calls various methods (which should be overridden in a subclass) when tags and data are encountered. Unfortunately there are many badly implemented HTML parsers out there, and as a result there are many badly formatted HTML files. This parser attempts to parse most HTML files. This means that the implementation sometimes deviates from the SGML specification in favor of HTML. The parser treats \r and \r\n as \n. Newlines after starttags and before end tags are ignored just as specified in the SGML/HTML specification. ...
ResourceLoader: Simple class to load resources using the 1.2 security model. Since the html support is loaded lazily, it's resources are potentially fetched with applet code in the call stack. By providing this functionality in a class that is only built on 1.2, reflection can be used from the code that is also built on 1.1 to call this functionality (and avoid the evils of preprocessing). This functionality is called from ParserDelegator.getResourceAsStream.
ContentModelState: A content model state. This is basically a list of pointers to the BNF expression representing the model (the ContentModel). Each element in a DTD has a content model which describes the elements that may occur inside, and the order in which they can occur. Each time a token is reduced a new state is created. See Annex H on page 556 of the SGML handbook for more information.
AttributeList: This class defines the attributes of an SGML element as described in a DTD using the ATTLIST construct. An AttributeList can be obtained from the Element class using the getAttributes() method. It is actually an element in a linked list. Use the getNext() method repeatedly to enumerate all the attributes of an element.
ContentModel: A representation of a content model. A content model is basically a restricted BNF expression. It is restricted in the sense that it must be deterministic. This means that you don't have to represent it as a finite state automata. See Annex H on page 556 of the SGML handbook for more information.
TagStack: A stack of tags. Used while parsing an HTML document. It, together with the ContentModelStates, defines the complete state of the parser while reading a document. When a start tag is encountered an element is pushed onto the stack, when an end tag is enountered an element is popped of the stack.
Element: An element as described in a DTD using the ELEMENT construct. This is essentiall the description of a tag. It describes the type, content model, attributes, attribute types etc. It is used to correctly parse a document by the Parser.
DTD: The representation of an SGML DTD. DTD describes a document syntax and is used in parsing of HTML documents. It contains a list of elements and their attributes as well as a list of entities defined in the DTD.
DTDConstants: SGML constants used in a DTD. The names of the constants correspond the the equivalent SGML constructs as described in "The SGML Handbook" by Charles F. Goldfarb.
ParserDelegator: Responsible for starting up a new DocumentParser each time its parse method is invoked. Stores a reference to the dtd.
Entity: An entity is described in a DTD using the ENTITY construct. It defines the type and value of the the entity.
TagElement: A generic HTML TagElement class. The methods define how white space is interpreted around the tag.
NPrintWriter
| Home | Contact Us | Privacy Policy | Terms of Service |