ISPN000299: Unable to acquire lock after 15 seconds for key SessionCreationMetaDataKey

SayoniM1 · April 24, 2023, 7:02pm

This is related to incident INC-268701.

We recently upgraded our application from pega v7.2 to pega v8.6.5 . After we migrated to prod, for 2 days everything worked fine after setting up the prod environment. However from the 3rd day onwards, we started seeing a lot of slowness, as well as the following error, on logging in with administrator access :
Org.infinispan.util.concurrent.TimeoutException : ISPN000299: Unable to acquire lock after 15 seconds for key SessionCreationMetaDataKey(NCb-6AkmthbIwwoZpSMc8kqoeRwtJ9G-JtQ5hYWu) and requestor GlobalTx:Pega-MX-slave2-mxcyvlpras2005:pega-mx-server-one:2036. Lock is held by GlobalTx:Pega-MX-slave2-mxcyvlpras2005:pega-mx-server-one:2034

We’re seeing a lot of deadlocks related to the job scheduler PZPURGEPRSYSSTATUSNODES in the logs :

ERROR    - [PersistentJobExecutionFactory] Job[pzPurgePRSysStatusNodes] execution lock has failed. 
com.pega.pegarules.pub.database.LockFailureException: Exception occurred while retrieving existing lock PZPURGEPRSYSSTATUSNODES: code: <none> SQLState: Problem executing lock check: code: 1205 SQLState: 40001 Message: Transaction (Process ID 83) was deadlocked on lock resources with another process and has been chosen as the deadlock victim. Rerun the transaction. 
DatabaseException caused by prior exception: com.microsoft.sqlserver.jdbc.SQLServerException: Transaction (Process ID 83) was deadlocked on lock resources with another process and has been chosen as the deadlock victim. Rerun the transaction. 
 | SQL Code: 1205 | SQL State: 40001

We have also noticed the below observations :

This slowness is only observed for dev/ops/admin users, i.e. while accessing the dev studio/admin studio/app studio. Our branch users who access the applications are fine. Hence the impact is more for the ops team who needs to access the admin studio, and this would also impact importing packages during deployments.
Somehow all the 6 nodes got configured as STREAM nodes, and the job scheduler PZPURGEPRSYSSTATUSNODES runs on all nodes.

Has anyone else faced similar issues? What is the impact if all nodes are configured as stream nodes?

SuryaYanamandra · October 26, 2023, 12:28pm

Hi @SayoniM1 Please let me know whether you got any fix for the above issue.

We are also facing the same, Please share any idea regarding the same

Regards,

Surya

MarijeSchillern · October 26, 2023, 1:17pm

@SayoniM1 @SuryaYanamandra ticket INC-268701 logged against Pega 8.6.5 was closed in June with the following note:

"
For the connection parameters that are added to the URL for the MSSQL database there are a few important parameters, namely “selectMethod=cursor” and “sendStringParametersAsUnicode=false” that can cause some significant issues if not present.

In addition, we recommend that the JDBC transaction isolation level be set to “READ_COMMITTED_SNAPSHOT” in order to avoid some potential deadlocks.

Read committed snapshot is set at the database level, and is a setting that would need to be configured by sp_configure.

Letting the job scheduler run on a single node should be fine, it is not attempting to run on more than one node.

The user confirmed that decommissioning of a node and changing the node type in test environment worked fine"

Please post any new questions relating to this type of issue as a New Question.

Conversation		Replies	Views
Requestor Lock Exception General system-administration , case-management , other-industry , 8-1-4	4	3560	March 2, 2021
ORA-00060: deadlock detected while waiting for resource Blueprint and App Design system-architect , performance , enterprise-application-development , financial-services , 7-4	3	258	October 19, 2022
Pega Lock mechanism General case-management , 8-4-1	17	13559	March 22, 2024
Job execution lock has failed. PersistentJobExecutionFactory. ORA-00001. DuplicateKeyException General pega-platform , senior-system-architect , system-architect , case-management , communications-and-media , 8-8-3	3	266	November 17, 2023
Transaction (Process ID ) was deadlocked on lock \| communication buffer resources with another process General senior-system-architect , other-industry , 8-8-5 , dev-designer-studio	6	529	October 16, 2024

ISPN000299: Unable to acquire lock after 15 seconds for key SessionCreationMetaDataKey

Related topics