How to create IH Summary Dataset with source as stream dataset

Hi,

We are moving our entire IH DB data to File repository, upon reading the pdn articles found that when we enable exporting to repository, IH data will start pushing in to pxResponsesStream and from there it will push to repository through data flow.

However our issues are. we want to migrate existing IH table data.

  1. If we manually prepare the JSON structure files and push in to file repository, will that work when we run IH summaries.

  2. When trying to IH Summary dataset, not able to give source as pxResponsesStream. Not sure why its not showing in dropdown. we need to use pxResponsesStream only to make use of File Repositories right.

  3. Does IH summaries calculate on pxReponsesStream dataset only, if yes, how does this dataset retrieves data from file repository.

Struck in proceeding further. Can someone help if you have idea on how exactly file repository things works.

Note :- We are working on Pega Infinity 23

@satishkumarreddym To migrate existing interaction history data, you need to prepare for a transition period during which the system saves interactions both to the database and to the InteractionFiles data set. The system will not recognize manually prepared JSON structure files in the file repository when running interaction history summaries. The pxResponsesStream data set is configured as the source for the export data flow. If it’s not showing in the dropdown, there might be a configuration issue.

The InteractionFiles data set, also known as the replay data set, is used to update interaction history summaries with the latest interactions. This occurs whenever you create a new summary or update an existing one. The system retrieves data from the file repository using the WriteDataToInteractionFiles data flow. (SB)

More info:

  1. Manually preparing JSON structure files and pushing them to the file repository will not work directly for running IH summaries. The system uses a specific Data Flow to export interactions to files in the repository, and these files are then used for analysis in business intelligence (BI) tools. The Data Flow runs create files in the repository every 900 seconds or after processing 75,000 records.

  2. If you are unable to select pxResponsesStream as the source for the IH Summary dataset, it might be due to configuration issues. Ensure that the pxResponsesStream Data Set is properly configured and available in your environment. Systems installed starting with Pega 8.8 use pxResponsesStream by default for IH aggregation.

  3. IH summaries do calculate on the pxResponsesStream dataset. During the export process, interactions are written to the pxResponsesStream, and from there, they are pushed to the file repository through the Data Flow. The system recalculates all aggregates within the Summary Data Sets from IH records exported to the File Repository. (SC)

:warning: This is a GenAI-powered tool. All generated answers require validation against the provided references.

Managing the export of interactions > Supporting artifacts for exporting interactions

Migrating interaction history from the relational database to a file repository

Preparing to migrate interactions

Managing data export

Interaction History Summaries and backfill process information