Save This Page
Home » nutch-1.0 » org.apache.nutch » parse » [javadoc | source]
org.apache.nutch.parse
public class: HTMLMetaTags [javadoc | source]
java.lang.Object
   org.apache.nutch.parse.HTMLMetaTags
This class holds the information about HTML "meta" tags extracted from a page. Some special tags have convenience methods for easy checking.
Method from org.apache.nutch.parse.HTMLMetaTags Summary:
getBaseHref,   getGeneralTags,   getHttpEquivTags,   getNoCache,   getNoFollow,   getNoIndex,   getRefresh,   getRefreshHref,   getRefreshTime,   reset,   setBaseHref,   setNoCache,   setNoFollow,   setNoIndex,   setRefresh,   setRefreshHref,   setRefreshTime,   toString
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.nutch.parse.HTMLMetaTags Detail:
 public URL getBaseHref() 
    A convenience method. Returns the baseHref, if set, or null otherwise.
 public Properties getGeneralTags() 
    Returns all collected values of the general meta tags. Property names are tag names, property values are "content" values.
 public Properties getHttpEquivTags() 
    Returns all collected values of the "http-equiv" meta tags. Property names are tag names, property values are "content" values.
 public boolean getNoCache() 
    A convenience method. Returns the current value of noCache.
 public boolean getNoFollow() 
    A convenience method. Returns the current value of noFollow.
 public boolean getNoIndex() 
    A convenience method. Returns the current value of noIndex.
 public boolean getRefresh() 
    A convenience method. Returns the current value of refresh.
 public URL getRefreshHref() 
    A convenience method. Returns the refreshHref, if set, or null otherwise. The value may be invalid if #getRefresh() returns false.
 public int getRefreshTime() 
    A convenience method. Returns the current value of refreshTime. The value may be invalid if #getRefresh() returns false.
 public  void reset() 
    Sets all boolean values to false. Clears all other tags.
 public  void setBaseHref(URL baseHref) 
    Sets the baseHref.
 public  void setNoCache() 
    Sets noCache to true.
 public  void setNoFollow() 
    Sets noFollow to true.
 public  void setNoIndex() 
    Sets noIndex to true.
 public  void setRefresh(boolean refresh) 
    Sets refresh to the supplied value.
 public  void setRefreshHref(URL refreshHref) 
    Sets the refreshHref.
 public  void setRefreshTime(int refreshTime) 
    Sets the refreshTime.
 public String toString()