Official Reference Website   Michael Schrenk

  No Starch Press,
  San Francisco

home chapter list downloads target addresses answers & updates purchase author contact
   
  What are "targets"?
Targets are the web pages (or web sites) that webbots and spiders use as resources. Since web pages change frequently, we decided to use canned web pages for all the web pages used (targetted) by the examples and projects in this book.

What are the bennefits of these targets?
These canned targets provide the following benefits to readers:

  1. There is no danger that these websites will change and adversely affect the operation of the example webbot projects in the book, ensuring that the book's example projects will work well into the future.
  2. These targets will allow you to focus on debugging your bots instead of worrying if the targetted websites have changed since the book was published.
  3. These targets provide a safe place for you to experiement.
Addresses of Target Pages
ChapterNameURLNotes
Introduction n.a. n.a.  
Chapter 1 n.a. n.a.  
Chapter 2 n.a. n.a.  
Chapter 3 Hello World! http://www.schrenk.com/nostarch/webbots/hello_world.htmlTarget for your first webbot
Chapter 4 n.a. n.a.
Chapter 5 Form example http://www.schrenk.com/search.php Used as an example form emulation
Form analyzer http://www.schrenk.com/nostarch/webbots/form_analyzer.php Analyzes submitted forms
Chapter 6 Sample webpage http://www.schrenk.comSample webpage
Sample image http://www.schrenk.com/north_beach.jpg Sample image
Chapter 7 Sample store
(with redirect)
http://www.schrenk.com/webbots/example_store Automatically redirects to the URL below.
Sample store http://www.schrenk.com/buyair This simple sample store is used to monitor prices. Prices change on a daily basis.
Chapter 8 Nasa Viking http://www.nasa.gov/mission_pages/viking/index.htmlThis is a NASA page with a lot of images for testing the Image Download Webbot
Chapter 9 Hyper-reference test page http://www.schrenk.com/nostarch/webbots/page_with_broken_links.phpThis page links to pages with links in various conditions (poorly defined, broken, internal errors, etc)
501 Error page http://www.schrenk.com/nostarch/webbots/501_error_page.phpCreates an HTTP 501 error
Chapter 10n.a.n.a. 
Chapter 11 Generic Search Page http://www.schrenk.com/nostarch/webbots/searchExmpample (static) seartch engine used by search ranking webbot
Chapter 12 RSS Page 1 http://www.mercurynews.com/mld/mercurynews/news/breaking_news/rss.xml Your should be able to substitue any of these pages with any other valid RSS feed.

Google "RSS" for more examples.

RSS Page 3 http://www.startribune.com/rss/1557.xml
RSS Page 3 http://www.mercurynews.com/mld/mercurynews/news/breaking_news/rss.xml
Chapter 13n.a.n.a. 
Chapter 14n.a.n.a. 
Chapter 15n.a.n.a. 
Chapter 16n.a.n.a. 
Chapter 17 Zip code form http://www.schrenk.com/nostarch/webbots/zip_code_form.phpExmpample zip code finding application
Chapter 18n.a.n.a. 
Chapter 19n.a.n.a. 
Chapter 20n.a.n.a. 
Chapter 21 Basic Authentication Example http://www.schrenk.com/nostarch/webbots/basic_authentication/
NOTE: This url redirects to http://www.webbotsspidersscreenscrapers.com/basic_authentication/
These three URL provide practice areas to write autoauthenticating webbots

The user names and passwords are published in the book.

Cookie Authentication Example http://www.schrenk.com/nostarch/webbots/cookie_authentication/
Query Authentication Exmaple http://www.schrenk.com/nostarch/webbots/query_authentication/
Chapter 22 Cookie writing example http://www.schrenk.com/nostarch/webbots/EXAMPLE_writing_cookies.phpThis web page writes a temporary cookie and a permanent cookie to your browser or webbot
Chapter 23n.a.n.a. 
Chapter 24n.a.n.a. 
Chapter 25 Page redirection http://www.schrenk.com/nostarch/webbots/head_redirection_test.phpThis web page performs an HTTP header redirection after a five second delay.
Simple form http://www.schrenk.com/nostarch/webbots/easy_form.phpThis form contains hidden values, parsed by LISTING 25-9.
Chapter 26 Sample XML http://www.schrenk.com/nostarch/webbots/26_1.php
While a webbot doesn't care, to view corrently in a browser, this file would need an .XML extension.
Chapter 26 Example of a light-weight interface http://www.schrenk.com/nostarch/webbots/26_3.phpThis file is a webbot interface and not intended to be read in a browser.

If you are reading this file in a browser, look at the page source to see the correct formatting.

Chapter 27n.a.n.a. 
Chapter 28n.a.n.a. 
 
home chapter list downloads target addresses updates purchase author contact
Copyright 2024, Michael Schrenk