@sharg3 can you confirm that you’ve first gone through the available documentation?
Externalization of services in your deployment
External Kafka in your deployment
External Elasticsearch in your deployment
External Hazelcast in your deployment
Third-party externalized services Deployment Changes FAQs
Monitoring an embedded stream service > Database
Understanding pr_data_stream tables > Nodes_
Externalisation of Kafka service
In Pega 8.8, if Hazelcast is down, it can impact the application’s performance and functionality. Features that rely on Hazelcast, such as distributed sessions, decisioning services, and others, may not function as expected. If the external Kafka nodes are down or there is corruption of replicas, it can disrupt the functioning of Queue Processors and Data Flows, as they rely on Kafka for message processing. This can lead to delays or failures in processing queued items. In both cases, Pega has implemented resilience measures to handle such scenarios. However, it’s important to monitor and maintain the health of these services to ensure optimal application performance.
In Pega 8.8, the pr_sys_statusnodes table is used to monitor the status of nodes in a Hazelcast cluster. If a node is in an unhealthy state and the application is recycled, the node should rejoin the same Hazelcast cluster, provided the network settings and cluster configuration have not changed. To validate stream nodes, you can use the pr_data_stream_nodes table which contains information about the Kafka cluster, including the list of all known Stream nodes, topics, data partition distribution, and the current controller node. For cluster restarts, it is recommended to follow the same process of restarting Stream nodes first, followed by Search and Web nodes. For monitoring Hazelcast and Kafka services, Pega provides built-in tools like PDC for system health monitoring. For on-premise clients, additional monitoring tools can be used based on the organization’s preference and infrastructure, such as Prometheus, Grafana, or any other tool that supports JMX monitoring.
This is a GenAI-powered tool. All generated answers require validation against the above provided references.
Please check internally for our HZ and Kafka experts if you need more help with this.
Please mark Accept Solution if you’re happy with the provided resources