Save This Page
Home » nutch-1.0 » org.apache.nutch » analysis » [javadoc | source]
org.apache.nutch.analysis
abstract public class: NutchAnalyzer [javadoc | source]
java.lang.Object
   org.apache.lucene.analysis.Analyzer
      org.apache.nutch.analysis.NutchAnalyzer

All Implemented Interfaces:
    Pluggable, org.apache.hadoop.conf.Configurable

Direct Known Subclasses:
    GermanAnalyzer, FrenchAnalyzer, NutchDocumentAnalyzer

Extension point for analysis. All plugins found which implement this extension point are run sequentially on the parse.
Field Summary
static final  String X_POINT_ID    The name of the extension point. 
protected  Configuration conf    The current Configuration 
Method from org.apache.nutch.analysis.NutchAnalyzer Summary:
getConf,   setConf,   tokenStream
Methods from org.apache.lucene.analysis.Analyzer:
getPositionIncrementGap,   reusableTokenStream,   tokenStream
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.nutch.analysis.NutchAnalyzer Detail:
 public Configuration getConf() 
 public  void setConf(Configuration conf) 
 abstract public TokenStream tokenStream(String fieldName,
    Reader reader)
    Creates a TokenStream which tokenizes all the text in the provided Reader.