Save This Page
Home » nutch-1.0 » org.apache.nutch » net » [javadoc | source]
org.apache.nutch.net
public interface: URLNormalizer [javadoc | source]

All Implemented Interfaces:
    org.apache.hadoop.conf.Configurable

All Known Implementing Classes:
    BasicURLNormalizer, RegexURLNormalizer, PassURLNormalizer

Interface used to convert URLs to normal form and optionally perform substitutions
Field Summary
public static final  String X_POINT_ID     
Method from org.apache.nutch.net.URLNormalizer Summary:
normalize
Method from org.apache.nutch.net.URLNormalizer Detail:
 public String normalize(String urlString,
    String scope) throws MalformedURLException