Portal, Express Beta Version 6.1
Operating systems: i5/OS, Linux,Windows |
At this time the Content Model Search Service has only one search collection. This search collection is provided with the installation by default. You cannot modify this default Content Model search collection or create additional search collections under the Content Model Search Service. The content model search service is listed because you can include it in scopes.
If the preferred language of the crawler user ID does not match the language of the search collection, users might not see search results in their language. Therefore, set the preferred language of the portal site crawler user ID to match the language of the portal site search collection that it crawls. If you do this after you started a crawl on the portal site search collection, you need to reset the portal site collection. Refer to Creating or resetting the portal site collection.
If your portal site is multilingual and your users use different languages to search WebSphere Portal Express, set the portal site collections up as described under Crawling a multilingual portal site.
In order to use external search services such as Google and Yahoo! with an English search keyword, a URL such as the sample URL mentioned in the Search Center portlet help for configuring the portlet works fine as is: http://www.google.com/search?q= . However, if you search in other languages, consult the documentation of the remote search service that you use to ensure that the Web interface is set up and used appropriately for the language that you use for your search. This can avoid problems with the displayed results, depending on the combination of languages set for WebSphere Portal Express, your browser, and the search.
key boardis indexed as two separate words key and board .
key- boardis indexed as one word keyboard.
The following search administration tasks can require extended periods of time:
These tasks are put in a queue. It might therefore take several minutes until they are executed and the respective time counters start, for example, the crawl Run time and the timeout for the crawl set by the option Stop collecting after (minutes): . The time required for these tasks is further influenced by the following factors:
Therefore both the time limits that you can specify and the times that are shown for these processes work as fuzzy time limits. This applies, for example, to the following scenarios:
Furthermore, this influences other status indicators given in the Manage Search portlet. For example, the number of documents shown for a content source can show with an unexpectedly low figure or even at zero ( 0 ) until the crawl on that content source has been completed.
Crawling can require large amounts of memory. This depends on your Portal Search environment. Therefore, before you start a crawl, make sure that WebSphere Portal Express has enough free memory. Memory shortage can cause a corrupted search collection and eventually lead to a system freeze.
To resolve this problem, raise the limit to the number of open files by using the ulimit command as root administrator.
EJPJP0009E: Wrong root url for Portal site crawler: https://root_urlYou can ignore this message. The crawl runs correctly.
To resolve this problem, edit the content source, select the General Parameters tab, and the set the parameter Stop fetching documents after (seconds): to a value of 90 seconds.
When you uninstall WebSphere Portal Express, the directories and files for the search collections are not deleted. Therefore, before you uninstall WebSphere Portal Express, delete all search collections by selecting the collections individually and clicking the option Delete Collection. If you do not do this, these files and directories remain on the hard drive. If you want to delete the search collection data after uninstalling WebSphere Portal Express, you need to do this manually. The directory path of a search collection is determined by what you typed in the field Location of Collection when you created the search collection. You can look up the collection location by performing the following steps:
The HTTP crawler of the Portal Search Service does not support JavaScript. Therefore some text of Web documents might not be accessible for search by users. This depends on how the text is prepared for presentation in the browser. Specifically text that is generated by JavaScript might or might not be available for search.
ulimit -n 4096
Problem: If the file path length for the location of search collections exceeds its limit, the collection cannot be created. This can occur particularly when the portal site collection is created under Linux operating systems.
Cause: The file path length for the portal search collection is limited to 118 characters. If this limit is exceeded, the default collection cannot be created. The following items contribute to the length of the file path:
Problem: A Search and Browse portlet cannot access the search collection to which you configured it.
Cause: If you migrated from a previous version of WebSphere Portal Express, the parameter for specifying the target search collection has been changed in the configuration for the Search and Browse portlet. The parameter IndexName has been replaced by CollectionLocation.
Solution: If you migrate from previous versions and have the Search and Browse portlet deployed, transfer the value from the old to the new parameter manually. For details about this refer to Migrating the Search and Browse portlet from version 5.x to 6.x.
Problem: If a cluster member in a cluster fails, users who were using the affected cluster member when the failover occurred can no longer access search collections. This can occur with horizontal scaling when a node fails or with vertical scaling when a particular cluster member fails.
Solution: Users who are logged into the cluster member that failed must log out of WebSphere Portal Express and then log back in before they will be able to access search collections again.
Search can return documents based on metadata of these documents, not just on words found in the fields or actual text of the document. It might appear to Portal Search users that their searches return documents which do not appear to match the search criteria.
Cause: Meta-data for documents is also indexed for search. Therefore if the meta-data of documents matched the search criteria, these documents are also returned as results for the search.
Solution: This works as designed and is usually considered to be of benefit.
If you delete a content source, then the documents that were collected from this content source will remain available for search by users under all scopes which included the content source before it was deleted.
Cause: These documents will be available until their expiration time ends.
Solution: The expiration time can be specified under Links expire after (days): under General Parameters when you created the content source.
The Portal Search portlets cannot be provided as WSRP services, as some additional and more advanced WebSphere Portal Express concepts and features are not reflected by the current WSRP standard yet. This includes the Portal Search portlets Manage Search, Taxonomy Manager, Search and Browse, and the Search Center portlets.
Parent topic: Portal Search