Save This Page
Home » lucene-2.3.2-src » org.apache » lucene » analysis » nl » [javadoc | source]
org.apache.lucene.analysis.nl
public class: DutchAnalyzer [javadoc | source]
java.lang.Object
   org.apache.lucene.analysis.Analyzer
      org.apache.lucene.analysis.nl.DutchAnalyzer
Analyzer for Dutch language. Supports an external list of stopwords (words that will not be indexed at all), an external list of exclusions (word that will not be stemmed, but indexed) and an external list of word-stem pairs that overrule the algorithm (dictionary stemming). A default set of stopwords is used unless an alternative list is specified, the exclusion list is empty by default.
Field Summary
public static final  String[] DUTCH_STOP_WORDS    List of typical Dutch stopwords. 
Constructor:
 public DutchAnalyzer() 
 public DutchAnalyzer(String[] stopwords) 
    Builds an analyzer with the given stop words.
    Parameters:
    stopwords -
 public DutchAnalyzer(HashSet stopwords) 
    Builds an analyzer with the given stop words.
    Parameters:
    stopwords -
 public DutchAnalyzer(File stopwords) 
    Builds an analyzer with the given stop words.
    Parameters:
    stopwords -
Method from org.apache.lucene.analysis.nl.DutchAnalyzer Summary:
setStemDictionary,   setStemExclusionTable,   setStemExclusionTable,   setStemExclusionTable,   tokenStream
Methods from org.apache.lucene.analysis.Analyzer:
getPositionIncrementGap,   getPreviousTokenStream,   reusableTokenStream,   setPreviousTokenStream,   tokenStream
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.lucene.analysis.nl.DutchAnalyzer Detail:
 public  void setStemDictionary(File stemdictFile) 
    Reads a stemdictionary file , that overrules the stemming algorithm This is a textfile that contains per line word\tstem, i.e: two tab seperated words
 public  void setStemExclusionTable(String[] exclusionlist) 
    Builds an exclusionlist from an array of Strings.
 public  void setStemExclusionTable(HashSet exclusionlist) 
    Builds an exclusionlist from a Hashtable.
 public  void setStemExclusionTable(File exclusionlist) 
    Builds an exclusionlist from the words contained in the given file.
 public TokenStream tokenStream(String fieldName,
    Reader reader) 
    Creates a TokenStream which tokenizes all the text in the provided TextReader.