Configuring a crawler to search your local portal site
Configure and run a search crawler on your local portal site to
gather information and create a search collection that enables your users
to search your portal site.
Portal Search provides a default portal site search collection that
enables your users to search your portal site. Before your users can search
the portal site collection, perform the following tasks.
- Optional: Set the crawler user ID. If you want
to use a dedicated crawler user ID for crawling the portal site content source,
define and set that crawler user ID:
- Define the crawler user ID by using the Manage Users and Groups
portlet.
- Set the preferred language of the portal site crawler user ID
to match the language of the portal site search collection that it crawls.
(If you do this after you started a crawl on the portal site search collection,
you need to reset the portal site collection. Refer to Creating or resetting the portal site collection.)
- Edit the portal site collection content source and fill in the
crawler user ID and its password. To do this, proceed as follows:
- Click .
- In the search collection list, click the Portal Content search
collection.
- Click the Edit icon next to the Portal Content
Source collection name.
- Under the General Parameters tab, type the crawler
user ID into the appropriate field.
- Under the Security tab, type the crawler password into the appropriate
field.
- Click Save.
- Optional: Configure the crawler to follow external
links. If you want the crawler to follow external links from inside the Portal,
you can modify the value in the Levels of links to follow field
under the General Parameters tab. Set the level to a value higher than 1.
In addition, you can configure filters for those external links from the Filters
tab. The default filter suppresses any links that point back to Portal pages.
The default filter is displayed only after saving the configuration of the
content source.
- Start the initial crawl. Start the initial crawl on the
portal site content source:
- Click .
- In the search collection list, click the Portal Content search
collection.
- Click the Start Crawler icon (right-pointing
arrow) next to the Portal content source name.
- Configure regular crawls. If you want regular crawls on
the portal site content source, perform either of the following tasks:
- Enable the default scheduler. To do this, proceed as follows:
- Click the
View Content Source Schedulers
icon next to the collection name.
- In the Manage Schedulers page, click Disabled.
This changes the status of the scheduler to Enabled and displays a confirmation
message.
- Set up your own scheduler. To do this, proceed as follows:
- Click the Edit icon for the content source.
- Select the Schedulers tab.
- Configure your own scheduler as required. For more details about how to
do this, refer to the Manage Search portlet help.
For more detailed information about how to work with content sources
refer to Managing the content sources of a search collection and to the Manage Search
portlet help.Notes:
- Only the main panels of the portlets on the portal pages are indexed and
can be searched. The crawler does not follow links that are specified within
a portlet.
- By default, items in the result lists from portal site searches provide
no summary information. If end users are using the Search and Browse portlet
they can refer to the information given under Description: for information
about the search result list item. If you want to have the summary information
added, configure the portlet with the summary parameter enabled as follows:
PortalCollectionSummarizer=on.
- When you crawl a portal site, be aware of the Memory required for crawls and
the Time required for crawls and imports and availability of documents.
- Set the preferred language of the crawler user ID to match the language
of the search collection that it crawls.
- The portal site search collection is created when an administrator navigates
to the Manage Search portlet. However, start the crawl for users
to be able to search the portal site. Depending on your portal configuration
and environment and possible customization, you might need to reset the portal
site search collection that was created. For details about such scenarios
and the necessary tasks to perform refer to Creating or resetting the portal site collection.
- If your users search the portal site search collection on a secured portal
site, refer to the additional information under Enabling search on a secured portal site with the default configuration.
When users search a portal site, they can access portal pages
of two types:
If you customize search on your portal site, you might find useful
information under Configuring the default location for search collections and Creating or resetting the portal site collection.
If your portal site is multilingual
and your users use different languages to search your portal, refer to Crawling a multilingual portal site.
Parent topic: Searching your local portal site
Related tasks
Crawling a multilingual portal site
Configuring search on a secured portal site
Creating or resetting the portal site collection
|
|
|