Site Inspect

SiteInspect is a tool to check contents of a web site for missing files and broken links. Apart from its basic function, it also measures download speed and suggests various improvements to the web pages.

Features:

System Requirements:

Screen Shot:

SiteInspect screen shot

Sample Report:

"link status","time to load","link href","referer"
"broken",1,"http://204.4.54.200/newinf.com/corpo/english/abou/client_profile/client.html","http://204.4.54.200/newinf.com/corpo/english/abou/client_profile/bjs.html"
"broken",0,"http://204.4.54.200/newinf.com/service/osdc/bene/intro.html#topics","http://204.4.54.200/newinf.com/service/osdc/bene/toc.html"
"broken",0,"http://204.4.54.200/newinf.com/service/osdc/bene/intro.html#1","http://204.4.54.200/newinf.com/service/osdc/bene/toc.html"
"broken",0,"http://204.4.54.200/newinf.com/service/osdc/bene/intro.html#2","http://204.4.54.200/newinf.com/service/osdc/bene/toc.html"
"broken",0,"http://204.4.54.200/newinf.com/service/osdc/bene/intro.html#3","http://204.4.54.200/newinf.com/service/osdc/bene/toc.html"
"skipped",0,"mailto:infosys@inf.com","http://204.4.54.200/newinf.com/contact/intro.html"
"skipped",0,"mailto:marticolwell@mindspring.com","http://204.4.54.200/newinf.com/press_corner/cybershop.html"
"skipped",0,"mailto:balakv@inf.com","http://204.4.54.200/newinf.com/corpo/english/fina/fin98-99/q3_98/indiangaap.html"
"valid",2,"http://204.4.54.200/newinf.com/contact/title.html","http://204.4.54.200/newinf.com/contact/index.html"
"valid",2,"http://204.4.54.200/newinf.com/contact/toc.html","http://204.4.54.200/newinf.com/contact/index.html"
"valid",4,"http://204.4.54.200/newinf.com/corpo/english/abou/client_profile/bjs.html","http://204.4.54.200/newinf.com/corpo/english/abou/client.html"
"valid",2,"http://204.4.54.200/newinf.com/corpo/english/abou/client_profile/copeland.html","http://204.4.54.200/newinf.com/corpo/english/abou/client.html"
"valid",2,"http://204.4.54.200/newinf.com/corpo/english/abou/client_profile/nordstrom.html","http://204.4.54.200/newinf.com/corpo/english/abou/client.html"
"valid",2,"http://204.4.54.200/newinf.com/corpo/english/abou/client_profile/nml.html","http://204.4.54.200/newinf.com/corpo/english/abou/client.html"
"link status","warning message","link href","referer"
"warning","ALT tag in anchor gives baloon help in IE. ","Anchor mailto:infosys@inf.com","http://204.4.54.200/newinf.com/contact/intro.html"
"warning","ALT tag in anchor gives baloon help in IE. ","Anchor http://204.4.54.200/newinf.com/corpo/english/abou/client_profile/bjs.html","http://204.4.54.200/newinf.com/corpo/english/abou/client.html"
"warning","ALT tag in anchor gives baloon help in IE. ","Anchor http://204.4.54.200/newinf.com/corpo/english/abou/client_profile/copeland.html","http://204.4.54.200/newinf.com/corpo/english/abou/client.html"

Internal Working:

SiteInspect is a VB application. It embeds the InternetExplorer within itself and browses the internet through it. It also uses the Microsoft Internet Control (Inet Control) to do additional HTTP operations. Once a document is loaded into the embeded Internet Explorer browser, SiteInspect gets into action. It uses the Document Object Model of the HTML exposed by the browser control to run through various tags. It verifies all images, StyleSheets and Java Script files, traverses links recursively, follows the action element of HTML forms and crawls through the web site. It collects and maintains statistics for every web page as it crawls and displays them on the screen. The "Report" option saves the information into a log file. The log file can then be imported into a database and analyzed further.

Source Code and Licensing:

Copyright (C) 1999 Tanmay K. Mohapatra
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, visit  http://www.gnu.org/copyleft/gpl.html or write to the Free Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA.

You are free to use this code and to make modifications provided this notice is retained.
The author is not responsible for any unforseen and unintended damages this program might cause.