| Home >> All >> org >> apache >> lenya >> [ lucene Javadoc ] |
| | org.apache.lenya.lucene.html.* (11) | | org.apache.lenya.lucene.index.* (13) | | org.apache.lenya.lucene.parser.* (9) |
org.apache.lenya.lucene: Javadoc index of package org.apache.lenya.lucene.
Package Samples:
org.apache.lenya.lucene.html
org.apache.lenya.lucene.index
org.apache.lenya.lucene.parser
Classes:
HtmlDocument: The HtmlDocument class creates a Lucene org.apache.lucene.document.Document from an HTML document. It does this by using JTidy package. It can take input input from java.io.File or java.io.InputStream .
ParseException: This exception is thrown when parse errors are encountered. You can explicitly create objects of this exception type by calling the method generateParseException in the generated parser. You can modify this class to customize your error reporting mechanisms so long as you retain the public fields.
AbstractIndexer: Abstract base class for indexers. The factory method #getDocumentCreator(String[]) is used to create a DocumentCreator from the command-line arguments.
SimpleCharStream: An implementation of interface CharStream, where the stream is assumed to contain only ASCII characters (without unicode processing).
PreParser: The Java HTML parser cannot handle self-closing text. This class converts all "/>" strings to ">" to avoid this problem.
DocumentCreator: An object of a class implementing this interface creates Lucene documents from files.
StringCleaner: A utility class to convert a string to text that can be included in an XML file.
HTMLParserFactory: Factory to create HTML parsers that are used for indexing HTML.
HTMLDocument: A utility for making Lucene Documents for HTML documents.
Publication: Parameters to do a search by Lucene and display results
ConfigurableDocumentCreator: Uses XSLT to transform a XML into a Lucene document
FileDocument: A utility for making Lucene Documents from a File.
IndexInformation: Helper class to hold indexing information
AbstractHTMLParser: Abstract base class for HTML parsers.
Token: Describes the input token stream.
SearchFiles: Command Line Interface
Main: DOCUMENT ME!
ReTokenizeFile: DOCUMENT ME!
| Home | Contact Us | Privacy Policy | Terms of Service |