Save This Page
Home » lucene-2.3.2-src » org.apache » lucene » analysis » cn » [javadoc | source]
org.apache.lucene.analysis.cn
public final class: ChineseFilter [javadoc | source]
java.lang.Object
   org.apache.lucene.analysis.TokenStream
      org.apache.lucene.analysis.TokenFilter
         org.apache.lucene.analysis.cn.ChineseFilter
Title: ChineseFilter Description: Filter with a stop word table Rule: No digital is allowed. English word/token should larger than 1 character. One Chinese character as one Chinese word. TO DO: 1. Add Chinese stop words, such as \ue400 2. Dictionary based Chinese word extraction 3. Intelligent Chinese word extraction Copyright: Copyright (c) 2001 Company:
Field Summary
public static final  String[] STOP_WORDS     
Fields inherited from org.apache.lucene.analysis.TokenFilter:
input
Constructor:
 public ChineseFilter(TokenStream in) 
Method from org.apache.lucene.analysis.cn.ChineseFilter Summary:
next
Methods from org.apache.lucene.analysis.TokenFilter:
close
Methods from org.apache.lucene.analysis.TokenStream:
close,   next,   next,   reset
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.lucene.analysis.cn.ChineseFilter Detail:
 public final Token next() throws IOException