Searching and crawling portal and other sites
Configure your local portal site, and crawl remote portal sites,
so that they are searchable by users. Run crawlers against other, external
Web sites to make them searchable by local portal users.
Users of your portal can search across various types of sites. In addition
to searching the local portal site, you can crawl remote portal sites, and
external Web sites, to make search results from those sites available to your
local portal users. Examples of search scenarios include:
- Users of your portal search your own local portal site. This can include
public and secure pages of your portal.
- Users of your portal site search other portal sites. This works only for
public pages of the other portals.
- Users of your portal search external Web sites such as yahoo.com or google.com
or w3.ibm.com. When you run a crawler against external Web sites, you can
collect and display external search results next to results from
your local portal site.
- External users search your portal site. This works only for public pages
of your portal.
- Searching your local portal site
These topics describe how to set up your local portal site for your users to search.
- Crawling a remote portal site
Configure Portal Search to crawl and index a remote, public portal site.
- Crawling an external site using a seedlist
The seedlist crawler is a special HTTP crawler that can be used to crawl external sites which publish their content using the seedlist format. The seedlist format is an ATOM/XML-based format specifically for publishing application content, including all its metadata. The format supports publishing only updated content between crawling sessions for more effective crawling. You can configure the seedlist crawler with general parameters, filters and schedulers, then run the crawler.
Parent topic: Portal Search
|
|
|