Cassandra Optimization and Upgrade Recommendations

I’ve noticed that our Cassandra nodes in the CDH-based Pega application experience significant load fluctuations.

Each node’s data load reaches approximately 75 GB, but after running garbage collection and a full repair, the load drops to 41 GB. This behavior suggests there might be excessive temporary data accumulation or inefficient data management.

Could you provide insights into optimizing data storage and garbage collection processes in this context, and whether this indicates any underlying issues with our current configuration or data models?

Can PDC be helpful in any manner to understand this behavior?

Additionally, we are planning to upgrade to Pega 24.1.1 / CDH 24.1. Could you please suggest the stable/recommended 4.x version of Cassandra to use? Currently, we are using Cassandra ReleaseVersion 3.11.3. Any relevant documentation would also be appreciated.

@RavikiranN7109 please use the Pega Documentation server and the PSC Keyword search capabilities in order to find this type of information:

1.To optimize your data storage it is essential to review your database metrics using Pega Diagnostic Center (PDC). PDC allows you to analyze database-specific metrics including storage consumption and usage trends. If your database space is continually trending upward it may indicate underlying issues with your current configuration or data models such as improper storage of case attachments. Implementing a case archiving policy can also help manage storage effectively. PDC provides insights into database performance helping you diagnose issues like excessive data in frequently accessed tables and missing indexes. By leveraging PDC you can monitor and optimize your data storage and understand the behavior of your database more effectively.

:warning: This is a GenAI-powered tool. All generated answers require validation against the provided references.

Managing your cloud data storage effectively

Optimizing PostgreSQL Database in Pega Cloud

  1. For Pega 24.1 it is recommended to use Cassandra 4.0.x or 4.1.x as Pega no longer supports Cassandra 3.x versions. You can find more information in the Platform Support Guide which details the supported versions of externalized Cassandra databases for Pega 24.1.

:warning: This is a GenAI-powered tool. All generated answers require validation against the provided references.

Support for Datastax Enterprise with Pega 24.1

Platform Support Guide