Rich Text Editor - Text Extraction in RTL Language

We are trying to store a content of arabic text using a Rich Text editor.

The clipboard has the text that includes html text.

When we need to extract text out of this, We had issues with RTL text changed to LTR in between lines.

to use replaceviaRegex - We need to list all the html tags that are possible from RTE which doesn’t solve our Issue completely.

What are the ways we can extract the text other than these.

We are currently trying out Java Libraries that can do this. But is there any way to do this using Pega OOTB rules?

@RahulKakunuru could you give us the version of Pega you are using?

Could you illustrate how you are using RTE (and the clipboard equivalent) and where the HTML text is embedded within these controls?

Is there any reason to use Rich text editor if you want to save as plain text?

You should just use Text area for this purpose.

Have you configured RTL as per the configuration documentation?

I found the following forum posts - to these help explain why the HTML tags are being displayed in the first place?

Convert Plain text to HTML (RTE)

retrieving the content without html tags

HTML tags are displayed in RTE - Instead should be displayed as formatted text

How to convert text entered in rich text editor control to plain text.

The solution is explained in the following support articles:

Unable to recognize HTML tags in the pyNote property

Rich Text Editor adds extra paragraph tags

Please try out the above suggestions and confirm if these provide a solution.

@MarijeSchillern I have found that you originally logged a support incident, and within that ticket you were informed that there was no OOTB method to carry out this procedure.

As this is now a Design question we will let forum users comment in case they have suggestions how to meet your UI needs.

Thanks for Suggestions.

We have achieved it using jsoup Library and it worked well.

@RahulKakunuru