Search and crawling portal and other sites
Configure your local portal site, and crawl remote portal sites, so that they are searchable by users. Run crawlers against other, external Web sites to make them searchable by local portal users. Users of the portal can search across various types of sites.
In addition to searching the local portal site, you can crawl remote portal sites, and external Web sites, to make search results from those sites available to your local portal users. Examples of search scenarios include:
- Users of the portal search your own local portal site. This can include public and secure pages of the portal.
- Users of the portal site search other portal sites. This works only for public pages of the other portals.
- Users of the portal search external Web sites such as yahoo.com or google.com or cnn.com. When you run a crawler against external Web sites, you can collect and display external search results next to results from your local portal site.
- External users search the portal site. This works only for public pages of the portal.
- Search your local portal site
View information on setting up your local portal site for your users to search.
- Crawl a remote portal site
Configure Portal Search to crawl and index a remote, public portal site.
- Crawl an external site using a seedlist
The seedlist crawler is a special HTTP crawler that can be used to crawl external sites which publish their content using the seedlist format.
The seedlist format is an ATOM/XML-based format specifically for publishing application content, including all its metadata. The format supports publishing only updated content between crawling sessions for more effective crawling.
You can configure the seedlist crawler with general parameters, filters and schedulers, then run the crawler.
Parent topic:
Portal Search