+

Search Tips   |   Advanced Search

Search and crawl portal and other sites

Configure the local portal site, and crawl remote portal sites, so they are searchable by users. Run crawlers against other, external Web sites to make them searchable by local portal users.

Users of the portal can search across various types of sites. In addition to searching the local portal site, we can crawl remote portal sites, and external Web sites, to make search results from those sites available to the local portal users. Examples of search scenarios include:

  • Users of the portal search our own local portal site. This can include public and secure pages of the portal.

  • Users of the portal search the WCM collection provided with the portal. This includes all Web Content Manager sites and libraries,

  • Users of the portal site search other portal sites. This works only for public pages of the other portals.

  • Users of the portal search external Web sites such as yahoo.com or google.com or cnn.com. When we run a crawler against external Web sites, we can collect and display external search results next to results from the local portal site.

  • External users search the portal site. This works only for public pages of the portal.

  • Reset the default search collection
    Under certain circumstances, we might want to change the configuration of the portal site search collection. In this case, we must re-create the collection, as search collections cannot be modified.

  • Crawl a remote portal site
    Configure Portal Search to crawl and index a remote, public portal site.

  • Crawl an external site using a seedlist provider
    The seedlist crawler is a special HTTP crawler used to crawl external sites which publish their content using the seedlist format. The seedlist format is an ATOM/XML-based format specifically for publishing application content, including all its metadata. The format supports publishing only updated content between crawling sessions for more effective crawling. We configure the seedlist crawler with general parameters, filters and schedulers, then run the crawler.


Parent Administer Portal Search