Data flow execution failed while the thread was initializing:

Campaign run failed without failed records.

Findings:

1.error message:

com.pega.pegarules.pub.context.PRStackFrameNotTopException: Specified stack frame '' does not match top of stack: 'Execute Offer'
	at com.pega.pegarules.session.internal.mgmt.Executable.popStackFrame(Executable.java:2128)
	at com.pega.decision.dsm.strategy.clipboard.DSMPegaAPI.popStackFrame(DSMPegaAPI.java:1590)
	at com.pega.dsm.dnode.impl.dataflow.task.strategy.ExecutionStrategy.executeDataFlow(ExecutionStrategy.java:135)
	at com.pega.dsm.dnode.impl.dataflow.task.strategy.SinglePartitionExecution.executePartitions(SinglePartitionExecution.java:77)
	at com.pega.dsm.dnode.impl.dataflow.service.run.ExecutePartitionsRunnable.executeDataFlowWithPartitions(ExecutePartitionsRunnable.java:86)
	at com.pega.dsm.dnode.impl.dataflow.service.run.ExecutePartitionsRunnable.access$900(ExecutePartitionsRunnable.java:34)
	at com.pega.dsm.dnode.impl.dataflow.service.run.ExecutePartitionsRunnable$1.run(ExecutePartitionsRunnable.java:67)
	at com.pega.dsm.dnode.impl.dataflow.service.run.ExecutePartitionsRunnable$1.run(ExecutePartitionsRunnable.java:62)
	at com.pega.dsm.dnode.util.PrpcRunnable.execute(PrpcRunnable.java:67)
	at com.pega.dsm.dnode.impl.prpc.service.ServiceHelper.executeInPrpcContextInternal(ServiceHelper.java:301)
	at com.pega.dsm.dnode.impl.prpc.service.ServiceHelper.executeInPrpcContext(ServiceHelper.java:146)
	at com.pega.dsm.dnode.impl.dataflow.service.run.ExecutePartitionsRunnable.call(ExecutePartitionsRunnable.java:62)
	at com.pega.dsm.dnode.impl.dataflow.service.run.ExecutePartitionsRunnable.call(ExecutePartitionsRunnable.java:34)
	at com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListenableFutureTask.java:108)
	at com.google.common.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:41)
	at com.google.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:77)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
	at com.pega.dsm.dnode.util.PrpcRunnable$1.run(PrpcRunnable.java:59)
	at com.pega.dsm.dnode.util.PrpcRunnable$1.run(PrpcRunnable.java:56)
	at com.pega.dsm.dnode.util.PrpcRunnable.execute(PrpcRunnable.java:67)
	at com.pega.dsm.dnode.impl.prpc.PrpcThreadFactory$PrpcThread.run(PrpcThreadFactory.java:124)

2.Found some fail partition in Dataflow → Distribution details → Partitions

Any one have met the issue before and know the root cause? Thanks in advance!!

@JimZ17051627

:warning: This is a GenAI-powered tool. All generated answers require validation against the provided references.

The error message indicates a stack frame mismatch where the specified stack frame (which appears empty) does not match the top of stack ‘Execute Offer’. This typically suggests an issue with the execution context during the dataflow operation, particularly when the system is trying to manage stack frames during the execution of strategies.

Based on the stack trace and the fact that you found failing partitions in the Dataflow distribution details, here are some possible causes and troubleshooting steps you might want to consider:

Potential Causes:

  1. Stack Frame Management Issues: There appears to be an inconsistency in how stack frames are being managed during the execution of your dataflow, particularly around the ‘Execute Offer’ context.
  2. Execution Strategy Context Problems: The error occurs in ExecutionStrategy.executeDataFlow and involves SinglePartitionExecution.executePartitions, suggesting potential issues with how the execution strategy is handling partition execution.
  3. Thread Initialization Conflicts: As your subject line mentions, the thread initialization process seems to be problematic, which could indicate resource contention or improper thread setup in the dataflow service.
  4. Partitioning Configuration: The mention of failing partitions suggests possible issues with how your dataflow partitions are configured or distributed.

Troubleshooting Steps:

  1. Review Dataflow Configuration:
    • Check the dataflow’s shape and components, particularly any components that might involve the “Execute Offer” process
    • Verify that all components are properly configured and compatible
  2. Examine Partition Settings:
    • Look at the details of the failing partitions you identified
    • Check for any unusual patterns in the failing partitions (size, data characteristics, etc.)
  3. Check for Resource Constraints:
    • Monitor system resources during execution to see if resource limitations might be causing thread initialization issues
    • Verify that your dataflow service nodes have sufficient resources allocated
  4. Review Error Threshold Settings:
    • Check if your dataflow might be hitting error thresholds that are triggering failures
  5. Inspect Related Rules and Activities:
    • If your dataflow involves custom activities or transformations, review these for potential issues
    • Pay particular attention to any components that might manipulate the execution context or stack
  6. Review Recent Changes:
    • Consider any recent changes to your application, dataflow configuration, or infrastructure that might have introduced this issue

If these steps don’t lead to a resolution, I would recommend opening a Support case to get assistance from Pega Support. lease provide the INC id once you’ve logged it via the MSP

[References: