Save This Page
Home » lucene-2.3.2-src » org.apache » lucene » analysis » cz » [javadoc | source]
org.apache.lucene.analysis.cz
public final class: CzechAnalyzer [javadoc | source]
java.lang.Object
   org.apache.lucene.analysis.Analyzer
      org.apache.lucene.analysis.cz.CzechAnalyzer
Analyzer for Czech language. Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified, the exclusion list is empty by default.
Field Summary
public static final  String[] CZECH_STOP_WORDS    List of typical stopwords. 
Constructor:
 public CzechAnalyzer() 
 public CzechAnalyzer(String[] stopwords) 
    Builds an analyzer with the given stop words.
 public CzechAnalyzer(HashSet stopwords) 
 public CzechAnalyzer(File stopwords) throws IOException 
    Builds an analyzer with the given stop words.
Method from org.apache.lucene.analysis.cz.CzechAnalyzer Summary:
loadStopWords,   tokenStream
Methods from org.apache.lucene.analysis.Analyzer:
getPositionIncrementGap,   getPreviousTokenStream,   reusableTokenStream,   setPreviousTokenStream,   tokenStream
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.lucene.analysis.cz.CzechAnalyzer Detail:
 public  void loadStopWords(InputStream wordfile,
    String encoding) 
    Loads stopwords hash from resource stream (file, database...).
 public final TokenStream tokenStream(String fieldName,
    Reader reader) 
    Creates a TokenStream which tokenizes all the text in the provided Reader.