Docjar: A Java Source and Docuemnt Enginecom.*    java.*    javax.*    org.*    all    new    plug-in

Quick Search    Search Deep

Package org.apache.lucene.analysis

Class Summary
Analyzer An Analyzer builds TokenStreams, which analyze text.
CharTokenizer An abstract base class for simple, character-oriented tokenizers.
LetterTokenizer A LetterTokenizer is a tokenizer that divides text at non-letters.
LowerCaseFilter Normalizes token text to lower case.
LowerCaseTokenizer LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together.
PerFieldAnalyzerWrapper This analyzer is used to facilitate scenarios where different fields require different analysis techniques.
PorterStemFilter Transforms the token stream as per the Porter stemming algorithm.
PorterStemmer Stemmer, implementing the Porter Stemming Algorithm The Stemmer class transforms a word into its root form.
SimpleAnalyzer An Analyzer that filters LetterTokenizer with LowerCaseFilter.
StopAnalyzer Filters LetterTokenizer with LowerCaseFilter and StopFilter.
StopFilter Removes stop words from a token stream.
TestAnalyzers  
TestPerFieldAnalzyerWrapper Copyright 2004 The Apache Software Foundation Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License.
TestStopAnalyzer  
Token A Token is an occurence of a term from the text of a field.
TokenFilter A TokenFilter is a TokenStream whose input is another token stream.
Tokenizer A Tokenizer is a TokenStream whose input is a Reader.
TokenStream A TokenStream enumerates the sequence of tokens, either from fields of a document or from query text.
WhitespaceAnalyzer An Analyzer that uses WhitespaceTokenizer.
WhitespaceTokenizer A WhitespaceTokenizer is a tokenizer that divides text at whitespace.