Analyzing Web Site Traffic
DevWebPro - For Professional Developers
Editor's Note -- 12.17.01

Hi WebPro Readers,

If the recent award of the Loebner Prize to the ALICE program is any indication of the state of artificial intelligence research, we have a long, long way to go. ALICE (Artificial Linguistic Internet Computer Entity) was ranked "most human computer" by the panel of judges. ALICE won the bronze medal and $2,000 at the annual Loebner Prize for Artificial Intelligence, held at the Science Museum in London last month. The top two medals remain unawarded. The silver medal will be given to any machine that can convince half the judges that it's human, and the gold goes to any machine that can do the same with speech instead of text. ALICE was only able to converse convincingly on one occasion during testing.

I just finished a delightful conversation with ALICE. Although she frequently responds with nonsensical answers, ALICE is good at answering simple questions, such as "What is your name?", "What is your quest?", and "What is your favorite color?". Visit the A.L.I.C.E. AI Foundation to have your own chat with ALICE.

It's going to be years before a convincing artificial entity is created. Unfortunately for the artificial intelligence community, big business has failed to find a use for a conscious artificial being. In typical capitalist fashion, businesses only want AI to give their customers better shopping experiences and to analyze huge piles of data. That means the big bucks aren't being invested in serious AI research. Not surprisingly, advances in AI might come from the computer game industry. Intelligent characters will give gamers a more immersive experience and a more realistic feel. Of course, that translates into profits for game companies.

In the latest installment from NetMechanic, Larisa Thomason gives detailed instructions on how you can use server log files to better understand your Web site's traffic patterns. Most hosting companies offer site statistics with their basic hosting packages, but if your host is still in the stone age you can download some of the analysis packages included in today's issue.

Thanks for reading,

Ryan Adkins
and the DevWebPro Team


Analyzing Web Site Traffic

Who is visiting your Web site? What browsers do they use? Where do they go in the site? What pages do they look at? Your Web server log files contain the answers to these questions and more. Once you start using your server log information, you'll wonder how you ever got along without it.

Increase Revenue - Click here

WHO GOES THERE?
Think of your server logs as a virtual visitor sign-in sheet. They record where your visitors came from and where they go on your site.

Server log files tell you:

- Which pages get the most traffic - and the least.

- What sites are referring visitors to you.

- Which pages visitors look at.

- What browsers and operating systems are most popular with visitors.

- When search engine spiders and directory editors visit.

This data can often help you flag specific problems on your Web site. If you have a lot of visitors, but few sales, check your server logs to see how many visitors actually see your product offerings. Wondering just how effective that expensive ad campaign really is? Your server logs can tell you how much traffic - and sales - it's really generating.

You can also use the data to uncover hidden problems on your site. Suppose you find that 20% of your visitors use the WebTV browser. Since sites display much differently in WebTV than in Explorer or Netscape, you'll probably need to consider optimizing your site for the WebTV browser.

READING YOUR LOG FILES
The server stores visitor information in files with the .log extension - open them as text files if you want to look at the raw log data. Unfortunately, you can't do much useful analysis with the raw data unless you're a math whiz. Here is how the log file records a single request to a Web server:

209.240.221.71 - - [03/Jan/2001:15:20:06 -0800] "GET
/Inauguration.htm HTTP/1.0" 200 8788 "http://www.democrats.com/"
"Mozilla/3.0 WebTV/1.2 (compatible; MSIE 2.0)"

HERE'S WHAT IT ALL MEANS

-SERVER LOG INFO- -COMPONENT- -NAME MEANING-

209.240.221.71 remotehost Name of the computer
requesting the Web page.

- rfc931 The name of the remote user.
This field is usually blank.

- authuser Login of the remote user.
This is also usually blank.

[03/Jan/2001:15:20:06 date Date and time of the
-0800] request.

"GET /Inauguration.htm request URL of the file requested.
HTTP/1.0" This is noted exactly as
the user requested it.

200 status Error or status code
generated by the request.

8788 bytes Size (in bytes) of the
document returned to the
client.

www.democrats.com/ referrer The URL the visitor came
from immediately before
they requested the file.

"Mozilla/3.0 WebTV/1.2 agent Records the visitor's
(compatible; MSIE 2.0)" browser and operating
system.

Hands-Free Reading! Listen to a great book now!Click Here!

The server logs provide a lot of important information: the trick is to put that data into an understandable format. If you have a large ecommerce site, then you probably already have a custom system in place. Smaller commercial or personal sites often rely on the Web site statistics programs supplied by their Web hosts.

LOG FILE ANALYSIS PACKAGES
If your Web host doesn't offer site statistics - or charges you for them - consider using some of the Web site analysis packages available for download. The CounterGuide.com Web site lists 20 packages with short reviews and product links. Packages vary widely in price and performance. Several like Analog, http Analyze, and RealTracker are free for non-commercial sites.

Analog
CounterGuide.com
http-Analyze
RealTracker

A basic package will organize the log information into several sections. At a minimum, you need to know the following:

Number of computers requesting content: This tells you how many visitors actually came to your site during a certain period - usually 24 hours. It's a much more useful gauge of site traffic than the total number of hits.

Files requested for download: The number of hits to your site. Each hit represents an individual file sent from the server - that includes image files, CGI scripts, and HTML pages. The log data should be broken out by each individual file name so you can tell exactly how many page views you have.

HTML page requests: The number of page views, which is a count of the number of times each individual Web page was requested. Ideally, you want visitors to look at more than just your home page. Divide the page view count by the number of visitors and you can get a rough estimate of how many pages each visitor actually sees.

Browser and operating system: Shows you which browser and operating system your visitors used. Use this information to decide which (if any) browser or operating system-specific elements it's safe to include on your site.

Referrer: A record of what page a visitor was at immediately before they arrived at your site. This information helps you determine which search engines send you the most traffic, the effectiveness of ad campaigns, and which links are the most active.

Note that the different software packages often use different terminology, so they may not use these exact terms. But you can easily evaluate them by paying close attention to the product specifications.

IDENTIFY WEB SITE PROBLEMS
Web site problems that completely break a Web page are pretty easy to spot, but others are more difficult to find and correct. Suppose your site gets a large number of unique visitors, but the total number of page views is small.

There could be a single problem - or a combination, such as poor site navigation, page content that isn't compelling, or design problems that keep visitors from viewing the page properly.

Improve poor site navigation
Fix design problems

NetMechanic's Web site tools can help you correct many of the problems you find during server log analysis. HTML Toolbox identifies HTML code errors and browser-specific tags that can turn away visitors while Browser Photo shows you actual snapshots of your Web page using 14 different browser and operating system combinations.

Detect HTML errors and more with NetMechanic's HTML Toolbox
Improve your designs with NetMechanic's Browser Photo

Use server log data to find problems on your site - then let NetMechanic help you fix them!

ABOUT THE AUTHOR:
Larisa Thomason is Senior Web Analyst, Specializing in Accessibility
at NetMechanic Inc., a leading developer of online maintenance,
monitoring and promotion services that has "tuned up" over 32 million
Web pages. She can be reached at larisa.thomason@netmechanic.com.
Learn more about NetMechanic tools at www.netmechanic.com.


   Free Newsletters

Freeware
WebMasterFree
LinuxWMF
MacWMF
FLADownloads
FreewareToday


Web Developers / Programmers
ColdFusionProNews
DesignNewz
DevNewz
DevWebPro
WebProNewsAU
WebProNewsCA
WebProNewsUK
FlashNewz
JavaProNews
MacProNews
TheDevWeb
ThePerlJournal
UnixProNews
WebProASP
XMLProNews


IT Managers/ Decision Makers
WebProNews
DevWebProCA
DevWebProAU
DevWebProUK
CRMProductReview
DatabaseProNews
HiTechEdge
LinuxProNews
NetworkNewz
SecurityProNews
SysAdminNews
WinXPDigest
WirelessProNews


Small Business Owners
ActivePro
ClicksToday
EcommerceTrends
EntrepreneurNewz
InsideOffice
NetDummy
promotenews
SearchNewz
SmallBusinessNewz
SohoDay
WebProWire
WomensBizNews


eBusiness Management
AdvertisingDay
CareerNewz
CRMNewz
EcommNewz
InvestNewz
ManagerNewz
MarketingNewz
NewsletterIndustry
SalesNewz
WebSiteNotes

Enter Email


Enter Postal Code


   Find Freeware

Search the WEBMASTERFREE database of free software:

   Corporate Info

Advertising
Have your ad reach thousands of webmasters and website owners.

contact:

Susan@DevWebPro.com

   Reader Tools

Archives
Check out past issues of DevWebPro. click here

Feedback
Have a gripe, comment, or idea? Let us know! click here

Print This Issue
Click here for a print version.


-- DevWebPro is an iEntry.com publication --
 © 2001 iEntry, Inc.  All Rights Reserved  Privacy Policy  Legal