Save This Page
Home » nutch-1.0 » org.apache.nutch » net » [javadoc | source]
org.apache.nutch.net
public interface: URLFilter [javadoc | source]

All Implemented Interfaces:
    org.apache.hadoop.conf.Configurable, Pluggable

All Known Implementing Classes:
    DomainURLFilter, SuffixURLFilter, RegexURLFilterBase, UrlValidator, PrefixURLFilter, Subcollection, RegexURLFilter, AutomatonURLFilter

Interface used to limit which URLs enter Nutch. Used by the injector and the db updater.
Field Summary
public static final  String X_POINT_ID    The name of the extension point. 
Method from org.apache.nutch.net.URLFilter Summary:
filter
Method from org.apache.nutch.net.URLFilter Detail:
 public String filter(String urlString)