+

Search Tips   |   Advanced Search

Indexing WCM web content in a search collection

To search for web content, use the Portal search engine to index the content. Once been indexed, we can run searches using either Search Center or a search component. We see search results for published documents only. Unpublished pending changes in a project are not included in the results.


Create a content source for a site area

The WebSphere Portal search engine defines content sources that index the web content. All the child site areas and content items of the selected site area will be included in the index. Related content sources are grouped together in a search collection.

  1. Go to...

      Administration > Search Administration > Manage Search > Collection > New Content Source

    The default search collection is WebContentCollection.

  2. For field "Content Source Type" select: WCM site

  3. For field "Content Source Name" enter a name.

  4. For field "URL in the Collect documents linked from this URL" field, enter one of the URLs listed below. If your library name or site area names contain spaces, replace the spaces with a "+" symbol. For example, the path library one/site area one would be instead be defined as library+one/site+area+one

    • Stand-alone server:

      http://hostname:port_number/wps/seedlist/myserver?SeedlistId=library/sitearea1/childsitearea2&Source=com.ibm.workplace.wcm.plugins.seedlist.retriever.WCMRetrieverFactory&Action=GetDocuments

    • Cluster:

      http://httpserver:port_number/wps/seedlist/myserver?SeedlistId=library/sitearea1/childsitearea2&Source=com.ibm.workplace.wcm.plugins.seedlist.retriever.WCMRetrieverFactory&Action=GetDocuments

    • Virtual portal with URL Context access point:

      http://httpserver:port_number/wps/seedlist/myserver/virtualPortalContext?SeedlistId=library/sitearea1/childsitearea2&Source=com.ibm.workplace.wcm.plugins.seedlist.retriever.WCMRetrieverFactory&Action=GetDocuments

    • Virtual portal with different hostname as access point:

      http://vphostname:port_number/wps/seedlist/myserver/?SeedlistId=library/sitearea1/childsitearea2&Source=com.ibm.workplace.wcm.plugins.seedlist.retriever.WCMRetrieverFactory&Action=GetDocuments

    The seedlist ID can be any of the following:

    • library
    • library/site area
    • library/site area/sub-site area/...
    • JCRID of a site area

  5. If the content to be indexed is secured, go to the Security tab and enter the user name and password of the user that will be used to access the secured site. Then click Create on the search tab itself.

  6. If the site uses remote actions, filter these out of the search index. Go to the Filter tab:

    1. Type a name in the Rule Name field

    2. Select Apply rule while Collecting documents

    3. Select the rule type of Exclude

    4. Select the rule basis of URL text

    5. Type *&wcmAuthoringAction=* in the URL text field

    6. Click Create in the Filter tab

  7. Click Create.
If you have multiple parent site areas and want the searches to run across all site areas, we can create a content source for each of them in the same collection. If you don't want the searches to run across all parent site areas, create a separate collection for each parent site area or group of related parent site areas.


Search web content in a virtual portal

Search services and search collections are separate for individual virtual portals and are not shared between individual virtual portals. You set up an individual search service and separate search collections for each virtual portal. These collections can be used to crawl and search the same set of documents.

If we are using a website that is shared across virtual portals, then to search that website in a virtual portal environment :

  1. Create a new search collection for the virtual portal. We can create a new content source by copying the URL from the original search collection.

  2. Create a new search component, or copy an existing search component, and configure it to use the new virtual portal search collection created in step 1.

  3. Create a new search form, using an HTML component, configured to use the search component created in step 2.

  4. Create a new content item to display the HTML component created in step 3.

You must perform these steps for each virtual portal in your system.


Parent: Enable search for web content

Next topic: Configure Web Content Manager search options