java.lang.ObjectTitle: ChineseFilter Description: Filter with a stop word table Rule: No digital is allowed. English word/token should larger than 1 character. One Chinese character as one Chinese word. TO DO: 1. Add Chinese stop words, such as \ue400 2. Dictionary based Chinese word extraction 3. Intelligent Chinese word extraction Copyright: Copyright (c) 2001 Company:org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.cn.ChineseFilter
Yiyi - Sun1.0 - | Field Summary | ||
|---|---|---|
| public static final String[] | STOP_WORDS | |
| Fields inherited from org.apache.lucene.analysis.TokenFilter: |
|---|
| input |
| Constructor: |
|---|
|
| Method from org.apache.lucene.analysis.cn.ChineseFilter Summary: |
|---|
| next |
| Methods from org.apache.lucene.analysis.TokenFilter: |
|---|
| close |
| Methods from org.apache.lucene.analysis.TokenStream: |
|---|
| close, next, next, reset |
| Methods from java.lang.Object: |
|---|
| equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method from org.apache.lucene.analysis.cn.ChineseFilter Detail: |
|---|
|