| Home >> All >> org >> apache >> lucene >> analysis >> [ standard Javadoc ] |
org.apache.lucene.analysis.standard: Javadoc index of package org.apache.lucene.analysis.standard.
Package Samples:
org.apache.lucene.analysis.standard: API and code to convert text into indexable tokens.
Classes:
CharStream: This interface describes a character stream that maintains line and column number positions of the characters. It also has the capability to backup the stream to some extent. An implementation of this interface is used in the TokenManager implementation generated by JavaCCParser. All the methods except backup can be implemented in any fashion. backup needs to be implemented correctly for the correct operation of the lexer. Rest of the methods are all used to get information like line number, column number and the String that constitutes a token and are not used by the lexer. Hence their implementation ...
StandardAnalyzer: Filters StandardTokenizer with StandardFilter , org.apache.lucene.analysis.LowerCaseFilter and org.apache.lucene.analysis.StopFilter .
StandardTokenizer: A grammar-based tokenizer constructed with JavaCC. This should be a good tokenizer for most European-language documents. Many applications have specific tokenizer needs. If this tokenizer does not suit your application, please consider copying this source code directory to your project and maintaining your own grammar-based tokenizer.
FastCharStream: An efficient implementation of JavaCC's CharStream interface. Note that this does not do line-number counting, but instead keeps track of the character position of the token in the input, as required by Lucene's org.apache.lucene.analysis.Token API.
ParseException: This exception is thrown when parse errors are encountered. You can explicitly create objects of this exception type by calling the method generateParseException in the generated parser. You can modify this class to customize your error reporting mechanisms so long as you retain the public fields.
StandardFilter: Normalizes tokens extracted with StandardTokenizer .
Token: Describes the input token stream.
StandardTokenizerConstants
StandardTokenizerTokenManager
TokenMgrError
| Home | Contact Us | Privacy Policy | Terms of Service |