The Stream service is stopping regularly , when timestamps are verified between servers then the following exceptions are found in Kafka server logs file . Attached the complete log file.
Seems to be trying to clean up the log file but unable to do it and then the stream service is getting stopped.
Once stream node is stopped then getting many exceptions in all nodes .
I found Pega article related to same problem but it Applies to Pega Platform™ version 8.5 through Pega Platform version 8.7 . we are on Pega 8.8.1
Note that this is not Pega issue but kafka limitation on Windows.
The changes shown in the article should prevent this issue. Also note that for any reason if kafka-data folder is deleted from sesrver, you have to perform above changes.
If the above resolves your issue I will contact the author of the Support Document in order to ask that Pega 8.8 version be added to the ‘applies to’ list.
@Brahmesh@ Based on the log you have shared and the error messages, it appears the Stream Services are not stopped gracefully.
When Steam Services are not externalized and had a improper shutdown of the Steam Services, it wll start behaving differently, usually kafka try to create a topic but that topic already exist or running from other stream node , since it was an ungraceful shutdown, no control communication between the members of the Kafka cluster.
Perhaps, review following steps to troubleshoot the problem.
– Stream Nodes Issue —
Shutdown Stream Nodes [Make sure both Kafka and Pega JVM Stopped]
Stream Nodes Should be Stopped One at a Time.
Stop Sequence is exact Opposite from Start Sequence
Always One Stream Node should be up and running for Pega before any node type node can come up
Web User Nodes always can be started last[Just best Practice]. Stopping doesnt matter.
Search Nodes always one at a time .. Typically Primary Search Node followed by Secondary Search Node should start first. While stopping same Order can be followed.
@Brahmesh@ This Error is Indicative of corrupted data in the Folder: Kafka-Data, which needs to e cleared. Even after clearing the Kafka-Data and restart the JVM’s, that requires additional debugging.
@MarijeSchillern we had raised the ticket INC-A17719 for it ..it got solved after executing the script sfor clean up policy that mentioned in the article .