Save This Page
Home » nutch-1.0 » org.apache.nutch » crawl » [javadoc | source]
org.apache.nutch.crawl
public class: MD5Signature [javadoc | source]
java.lang.Object
   org.apache.nutch.crawl.Signature
      org.apache.nutch.crawl.MD5Signature

All Implemented Interfaces:
    org.apache.hadoop.conf.Configurable

Default implementation of a page signature. It calculates an MD5 hash of the raw binary content of a page. In case there is no content, it calculates a hash from the page's URL.
Fields inherited from org.apache.nutch.crawl.Signature:
conf
Method from org.apache.nutch.crawl.MD5Signature Summary:
calculate
Methods from org.apache.nutch.crawl.Signature:
calculate,   getConf,   setConf
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.nutch.crawl.MD5Signature Detail:
 public byte[] calculate(Content content,
    Parse parse)