+

Search Tips   |   Advanced Search

Enable indexing resumption

We can add a configuration setting to search-config.xml to specify that interrupted or failed indexing tasks are automatically resumed.

SearchCellConfig.setIndexingResumptionAllowed enables the resumption of failed or interrupted indexing tasks that have not yet reached a resume point. When you enable this functionality and an indexing task fails or is interrupted, the task resumes at the start of the previous seedlist page rather than from the previous resume point.

Indexing resumption is disabled by default when we install Connections. When you run the SearchCellConfig.setIndexingResumptionAllowed command, the allowResumption setting, which specifies that interrupted or failed indexing tasks are automatically resumed, is added to the search-config.xml configuration file.

<indexSettings allowResumption="true" 
   location="${SEARCH_INDEX_DIR}" maxIndexerThreads="1"/>

You might want to consider enabling indexing resumption after installation because, if there is an interruption during initial indexing, this feature allows indexing to resume from where it left off. Normally, only crawling and file content extraction resume from where they are left off after an interruption. However, the indexing resumption feature has an impact on performance, and there is little benefit to enabling it during incremental indexing as incremental indexing typically executes very quickly.

To enable indexing resumption.

  1. Start wsadmin...

  2. Initialize the Search environment, and start the Search script interpreter:

      execfile("searchAdmin.py")

    If successful

      Search Administration initialized

  3. Check out search-config.xml :

    SearchCellConfig.checkOutConfig("working_dir", "cellName")

    where:

    • working_dir is the temporary directory to check out the cell level configuration file. This directory must exist. Use forward slashes to separate directories, even with Windows.

      AIX , and Linux only: The directory must grant write permissions or the command will not run successfully.

    • cellName is the name of the cell the Search node belongs to. Required. Case-sensitive. To determine:

      print AdminControl.getCell()

    For example:

      SearchCellConfig.checkOutConfig("c:/search_temp", "SearchServerNode01Cell")

  4. Run:

    SearchCellConfig.setIndexingResumptionAllowed(boolean allowed)

    Enable or disable resumption of interrupted or failed indexing tasks that have not reached a resume point. This command takes a single argument:

    • allowed. A boolean value.

    For example, to enable indexing resumption:

      SearchCellConfig.setIndexingResumptionAllowed("true")

  5. Check in search-config.xml:

    SearchCellConfig.checkInConfig()

  6. To exit the wsadmin client, type exit at the prompt.

  7. Stop the server or servers hosting the Search application, and then restart the Search servers. The next time the scheduled task runs, persisted seedlists are retained after indexing finishes.


Parent topic:
Index settings


Related:

Verify that Search is crawling regularly

Configure the number of crawling threads