Configure the number of crawling threads 

Edit settings in the search-config.xml file to specify the maximum number of seedlist threads used when crawling. The maximum number of threads that you should specify is the number of applications that you have installed in your deployment.


Before you begin

To edit configuration files, use the IBM WAS wsadmin client. See Starting the wsadmin client for details.


About this task

By default, the maximum number of seedlist threads allowed when crawling is 2, however you can change this value by modifying the search-config.xml file.


Procedure

To update the maximum number of crawling threads that can be used when crawling...

  1. From the dmgr host:

      cd $DMGR_PROFILE/bin
      ./wsadmin.sh -jython
      execfile("searchAdmin.py")

      If prompted to specify a service to connect to, type 1 to pick the first node in the list. Most commands can run on any node. If the command writes or reads information to or from a file using a local file path, pick the node where the file is stored.

  2. Check out the Search cell-level configuration file using the following command:

      SearchCellConfig.checkOutConfig("<working_dir>", "<cellName>")

      where:

      • <working_dir> is the temporary directory to which you want to check out the cell level configuration file. This directory must exist on the server where you are running the wsadmin client. Use forward slashes to separate directories in the file path, even if you are using the Microsoft Windows operating system.

          Note: AIX and Linux only: The directory must grant write permissions or the command will not run successfully.

      • <cellName> is the name of the cell that the Search node belongs to. This argument is required. It is also case-sensitive, so type it with care. If you do not know the cell name, you can determine it by typing the following command in the wsadmin command processor:

          print AdminControl.getCell()

      For example:

      SearchCellConfig.checkOutConfig("c:/search_temp", "SearchServerNode01Cell")

  3. Use the following command:

      SearchCellConfig.setMaximumCrawlerThreads(String maxThreadNumber)

        Specifies the maximum number of seedlist threads that can be used when crawling. By default, the value is set to 2.

        This command takes a single argument that specifies the number of threads allowed.

        For example:

        SearchCellConfig.setMaximumCrawlerThreads("3")

  4. Check in the changed configuration property keys using the following wsadmin client command:

      SearchCellConfig.checkInConfig()

  5. To exit the wsadmin client, type exit at the prompt.

  6. Stop the server or servers hosting the Search application, delete the index, and then restart the Search servers.

      The next time the scheduled task runs, it recreates the index.


Parent topic

Manage the Search index


   

 

});