Docjar: A Java Source and Docuemnt Enginecom.*    java.*    javax.*    org.*    all    new    plug-in

Quick Search    Search Deep

org.htmlparser.tests.scannersTests
Class LinkScannerTest  view LinkScannerTest download LinkScannerTest.java

java.lang.Object
  extended byjunit.framework.Assert
      extended byjunit.framework.TestCase
          extended byorg.htmlparser.tests.ParserTestCase
              extended byorg.htmlparser.tests.scannersTests.LinkScannerTest
All Implemented Interfaces:
junit.framework.Test

public class LinkScannerTest
extends org.htmlparser.tests.ParserTestCase


Field Summary
 
Fields inherited from class org.htmlparser.tests.ParserTestCase
node, nodeCount, parser, reader
 
Fields inherited from class junit.framework.TestCase
 
Constructor Summary
LinkScannerTest(java.lang.String name)
           
 
Method Summary
 void _testLinkWithJSP()
          Bug report by Cory Seefurth
 void testAccessKey()
           
 void testBadImageInLinkBug()
           
 void testBaseRefLink()
           
 void testBrokenLink()
          Test suggested by Cedric Rosa A really bad link tag sends parser into infinite loop
 void testErroneousLinkBug()
           
 void testErroneousLinkBugFromYahoo()
          Test case based on a report by Raghavender Srimantula, of the parser giving out of memory exceptions.
 void testErroneousLinkBugFromYahoo2()
          Test case based on a report by Raghavender Srimantula, of the parser giving out of memory exceptions.
 void testEvaluate()
           
 void testExtractLinkInvertedCommasBug()
          This is the reproduction of a bug which causes a null pointer exception
 void testExtractLinkInvertedCommasBug2()
          This is the reproduction of a bug which produces multiple text copies.
 void testFreshMeatBug()
          A bug in the freshmeat page - really bad html tag - <A>Revision<\a> Reported by Mazlan Mat
 void testFTPLink()
           
 void testInsertEndTagBeforeTag()
           
 void testJavascriptLink()
           
 void testLinkContainsEqualTo()
          This is an attempt to reproduce bug 677874 reported by James Moliere.
 void testLinkDataContents()
           
 void testLinkScannerFilter()
           
 void testLinkSpacesBug()
          Bug pointed out by Sam Joseph (sam@neurogrid.net) Links with spaces in them will get their spaces absorbed
 void testMailtoLink()
           
 void testMultipleLineBug()
          Bug reported by Raj Sharma,5-Apr-2002, upon parsing http://www.samachar.com, the entire page could not be picked up.
 void testNotFTPLink()
           
 void testNotJavascriptLink()
           
 void testNotMailtoLink()
           
 void testQueryLink()
          This is a reproduction of bug 617228, reported by Stephen J.
 void testRelativeLinkNotHTMLBug()
           
 void testRelativeLinkScan()
           
 void testRelativeLinkScan2()
           
 void testRelativeLinkScan3()
           
 void testReplaceFaultyTagWithEndTag()
           
 void testScan()
          Test scan with data which is of diff nodes type
 void testTagSymbolsInLinkText()
           
 
Methods inherited from class org.htmlparser.tests.ParserTestCase
assertHiddenIDTagPresent, assertNodeCount, assertNodeCount, assertSameType, assertStringEquals, assertTagEquals, assertType, assertXmlEquals, createParser, createParser, createParser, createParser, parse, parseAndAssertNodeCount, parseNodes, removeEscapeCharacters
 
Methods inherited from class junit.framework.TestCase
countTestCases, createResult, getName, run, run, runBare, runTest, setName, setUp, tearDown, toString
 
Methods inherited from class junit.framework.Assert
assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertEquals, assertFalse, assertFalse, assertNotNull, assertNotNull, assertNotSame, assertNotSame, assertNull, assertNull, assertSame, assertSame, assertTrue, assertTrue, fail, fail
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Constructor Detail

LinkScannerTest

public LinkScannerTest(java.lang.String name)
Method Detail

testAccessKey

public void testAccessKey()
                   throws org.htmlparser.util.ParserException

testErroneousLinkBug

public void testErroneousLinkBug()
                          throws org.htmlparser.util.ParserException

testErroneousLinkBugFromYahoo2

public void testErroneousLinkBugFromYahoo2()
                                    throws org.htmlparser.util.ParserException
Test case based on a report by Raghavender Srimantula, of the parser giving out of memory exceptions. Found to occur on the following piece of html
 
  <a href=s/8741><img src="http://us.i1.yimg.com/us.yimg.com/i/i16/mov_popc.gif" height=16 width=16 border=0></img>
 
   
  <a href=s/7509>
  
 


testErroneousLinkBugFromYahoo

public void testErroneousLinkBugFromYahoo()
                                   throws org.htmlparser.util.ParserException
Test case based on a report by Raghavender Srimantula, of the parser giving out of memory exceptions. Found to occur on the following piece of html
 
  <a href=s/8741><img src="http://us.i1.yimg.com/us.yimg.com/i/i16/mov_popc.gif" height=16 width=16 border=0></img>This is test
  <a href=s/7509>
  
 


testEvaluate

public void testEvaluate()

testExtractLinkInvertedCommasBug

public void testExtractLinkInvertedCommasBug()
                                      throws org.htmlparser.util.ParserException
This is the reproduction of a bug which causes a null pointer exception


testExtractLinkInvertedCommasBug2

public void testExtractLinkInvertedCommasBug2()
                                       throws org.htmlparser.util.ParserException
This is the reproduction of a bug which produces multiple text copies.


testLinkSpacesBug

public void testLinkSpacesBug()
                       throws org.htmlparser.util.ParserException
Bug pointed out by Sam Joseph (sam@neurogrid.net) Links with spaces in them will get their spaces absorbed


testMultipleLineBug

public void testMultipleLineBug()
                         throws org.htmlparser.util.ParserException
Bug reported by Raj Sharma,5-Apr-2002, upon parsing http://www.samachar.com, the entire page could not be picked up. The problem was occurring after parsing a particular link after which the parsing would not proceed. This link was spread over three lines. The bug has been reproduced and fixed.


testRelativeLinkScan

public void testRelativeLinkScan()
                          throws org.htmlparser.util.ParserException

testRelativeLinkScan2

public void testRelativeLinkScan2()
                           throws org.htmlparser.util.ParserException

testRelativeLinkScan3

public void testRelativeLinkScan3()
                           throws org.htmlparser.util.ParserException

testScan

public void testScan()
              throws org.htmlparser.util.ParserException
Test scan with data which is of diff nodes type


testReplaceFaultyTagWithEndTag

public void testReplaceFaultyTagWithEndTag()
                                    throws org.htmlparser.util.ParserException

testInsertEndTagBeforeTag

public void testInsertEndTagBeforeTag()
                               throws org.htmlparser.util.ParserException

testFreshMeatBug

public void testFreshMeatBug()
                      throws org.htmlparser.util.ParserException
A bug in the freshmeat page - really bad html tag - <A>Revision<\a> Reported by Mazlan Mat


testBrokenLink

public void testBrokenLink()
                    throws org.htmlparser.util.ParserException
Test suggested by Cedric Rosa A really bad link tag sends parser into infinite loop


testLinkDataContents

public void testLinkDataContents()
                          throws org.htmlparser.util.ParserException

testBaseRefLink

public void testBaseRefLink()
                     throws org.htmlparser.util.ParserException

testQueryLink

public void testQueryLink()
                   throws org.htmlparser.util.ParserException
This is a reproduction of bug 617228, reported by Stephen J. Harrington. When faced with a link like : <A HREF="/cgi-bin/view_search?query_text=postdate>20020701&txt_clr=White&bg_clr=Red&url=http://loc al host/Testing/Report 1.html">20020702 Report 1</A> parser is unable to handle the link correctly due to the greater than symbol being confused to be the end of the tag.


testNotMailtoLink

public void testNotMailtoLink()
                       throws org.htmlparser.util.ParserException

testMailtoLink

public void testMailtoLink()
                    throws org.htmlparser.util.ParserException

testJavascriptLink

public void testJavascriptLink()
                        throws org.htmlparser.util.ParserException

testNotJavascriptLink

public void testNotJavascriptLink()
                           throws org.htmlparser.util.ParserException

testFTPLink

public void testFTPLink()
                 throws org.htmlparser.util.ParserException

testNotFTPLink

public void testNotFTPLink()
                    throws org.htmlparser.util.ParserException

testRelativeLinkNotHTMLBug

public void testRelativeLinkNotHTMLBug()
                                throws org.htmlparser.util.ParserException

testBadImageInLinkBug

public void testBadImageInLinkBug()
                           throws org.htmlparser.util.ParserException

testLinkContainsEqualTo

public void testLinkContainsEqualTo()
                             throws java.lang.Exception
This is an attempt to reproduce bug 677874 reported by James Moliere. A link tag of the form Shoe bomber handed life sentence is not parsed correctly. The second '=' sign in the link causes the parser to treat it as a seperate attribute


_testLinkWithJSP

public void _testLinkWithJSP()
                      throws java.lang.Exception
Bug report by Cory Seefurth


testLinkScannerFilter

public void testLinkScannerFilter()
                           throws java.lang.Exception

testTagSymbolsInLinkText

public void testTagSymbolsInLinkText()
                              throws java.lang.Exception