Setting up an options object for the Webserver search engine on HTTP Server

 

In the IBM HTTP Server for i5/OS, you can set up an options object for use with the Webserver search engine with the IBM Web Administration for i5/OS interface..

Information for this topic supports the latest PTF levels for HTTP Server for i5/OS . IBM recommends that you install the latest PTFs to upgrade to the latest level of the HTTP Server for i5/OS. Some of the topics documented here are not available prior to this update. See IBM Service for more information.

An options object contains values that are used when remote web sites are crawled. If you select to edit an existing object, the current values are displayed and can be changed. This object can be selected together with a URL object to use when you select to build document lists by crawling remote web sites. To create an options object, do the following:

  1. Click the Advanced tab.

  2. Click the Search Setup subtab.

  3. Expand Search Engine Setup.

  4. Click Build options object.

  5. Choose options object options:

    Create an options object

    Select this option to create a new options object. Enter the name of the new options object.

    Select an options object to edit

    Select this option to edit an existing options object. Select the options object from the list.

  6. Click Apply.

  7. Enter crawling options:

    Proxy server for HTTP

    Enter the proxy server for HTTP requests. Possible values include any valid server name.

    Proxy port for HTTP

    Enter the port number for the above proxy server. A port is required if a proxy server for HTTP is specified.

    Proxy server for HTTPS

    Enter the proxy server for HTTPS requests. Possible values include any valid server name.

    Proxy port for HTTPS

    Enter the port number for the above proxy server. A port is required if a proxy server for HTTPS is specified.

    Maximum file size to download

    Enter the maximum size for a downloaded file (in KB).

    Maximum storage for files

    Enter the maximum storage space for all downloaded files (in MB).

    Maximum threads

    Enter the maximum number of threads used during web crawling. Set this value based on the system resources that are available.

    Maximum run time

    Enter the maximum amount of time the crawling session remains active in hours and minutes.

    Activity log file

    Enter the action to take for an activity log file. This file contains information about the crawling session plus any errors that occur. This file must be in a directory of the IFS. You can choose to run a crawling session with or without an activity log file. You also have the option of replacing the log file each time a crawling session is started or appending information to the existing file.

    There are two additional options if you choose to write an activity log.

    Create or replace the logging file

    Select this option if the log file does not exist or you want to overwrite an existing log file.

    Append to the existing logging file

    Select this option to add any new information to the existing log file. This option will not delete existing information.

  8. Click Apply.

Your new options object can now be used when Web crawling remote sites.

 

Parent topic:

Search tasks