Activity stream search service

Activity stream search service

The activity stream search service, bundled with the News application, provides indexing and search infrastructure.
The service is automatically configured to crawl the activity stream seedlist every 30 seconds. After the initial full crawl of the activity stream, subsequent crawls are incremental, and only new events generated since the previous crawl are collected.
When you install Connections, the crawler is disabled by default.
Crawl and indexing runs on a cluster member where the News application is deployed, chosen automatically by the WebSphere High Availability (HA) Manager. If News becomes unavailable on this server, a different server running News is chosen to replace it. For each crawling session, the indexing server creates a delta index in a shared file system, and sends a notification to other nodes in the cluster. This delta index is read from shared file system by the other nodes and merged into the main index on the local disk. All the cluster nodes serve search requests by reading from the local index. Configuration and status information for the crawlers is stored in database tables available to all the nodes. Delta indexes are stored for 24 hours. If a node is down for more than 24 hours, copy the index manually to that node from another node.
We manage the activity stream search service from a browser user interface, the Activity Stream Search Administration page. We can enable or disable the crawler, edit the crawler schedule, clear the current indexed content, and perform a full crawl.

Administer activity stream search
Copy the activity stream search index to new nodes
Configure activity stream search index settings
Restore the activity stream search index

Parent topic:
Administer the News repository

Related:
Administer Search
Roles