Crawl Web content with search seedlists
Portal Search supports the use of seedlists to make crawling Web sites and their metadata more efficient and to provide content owners fine-grained control over how content and metadata are crawled.
You can configure the portal to leverage seedlist support when crawling content generated with IBM Lotus Web Content Management. By default Portal Search is configured to use seedlist format 1.0 when indexing content for search collections. When used with Web content, seedlist format 1.0 provides advantages such as integration between search results and Web content pages, as well as support for IBM Omnifind Enterprise Edition. However, if you still require seedlist format 0.9 for Web content you can configure the portal to use that format.
- Use the search seedlist 1.0 format
By default Portal Search is configured to support the search seedlist 1.0 format. Using the seedlist 1.0 format makes it possible to leverage the Web content page type to render content found in the search results on the corresponding Web content page. You can also include custom metadata fields from a Web content item that will appear in the search seedlist but not in the HTML source.
- Use the search seedlist 0.9
Although Portal Search is configured to support the search seedlist 1.0 format by default, you can reconfigure the portal to use the standard seedlist 0.9 format when searching for Web content with the Search Center.
For example, you might choose to use seedlist format 0.9 because you want to make use of older search collections or because you retrieve the seedlist 0.9 contents using the seedlist URL, which uses a different syntax from the URL used with the search seedlist 1.0 format.
Parent topic:
Enable search for Web content
Previous topic:
Configure Search Center to search for Web content