I am working with an xml document that has html tags embedded in it. The character entities that are embedded in the html need to be converted to tags:
For example, the following character entities are part of the html:
’
—
I need to perform these conversions Convert '’' to '’'
Convert '—' to '—'
Convert ' ' to ' '
I have successfully converted these 3 character entities in the DTD by adding entity declarations. For example:
<!ENTITY rsquo "’">
I will be receiving these xml documents frequently, and the html text changes, so there will probably be additional character entities (beyond the 3 above) that require conversion.
Is there another way to perform these character entity conversions without having to add an entity declaration in the DTD for each one?