Save This Page
Home » nutch-1.0 » org.apache.nutch.urlfilter » regex » [javadoc | source]
org.apache.nutch.urlfilter.regex
public class: RegexURLFilter [javadoc | source]
java.lang.Object
   org.apache.nutch.urlfilter.api.RegexURLFilterBase
      org.apache.nutch.urlfilter.regex.RegexURLFilter

All Implemented Interfaces:
    URLFilter

Filters URLs based on a file of regular expressions using the Java Regex implementation .
Constructor:
 public RegexURLFilter() 
 public RegexURLFilter(String filename) throws PatternSyntaxException, IOException 
 RegexURLFilter(Reader reader) throws IllegalArgumentException, IOException 
Method from org.apache.nutch.urlfilter.regex.RegexURLFilter Summary:
createRule,   getRulesFile,   main
Methods from org.apache.nutch.urlfilter.api.RegexURLFilterBase:
createRule,   filter,   getConf,   getRulesFile,   main,   setConf
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.nutch.urlfilter.regex.RegexURLFilter Detail:
 protected RegexRule createRule(boolean sign,
    String regex) 
 protected String getRulesFile(Configuration conf) 
 public static  void main(String[] args) throws IOException