I am new on this forum, and I can see there were plenty of post regarding problem with unicode and so on.
Before I posted this thread I have tried to find the right problem on this forum that would lead me to resolve my issue.
Unfortunately I didn't find so I write.
I am creating the site that user may post a text containing mix of different characters (i.e japanese). The pages I have done must be in UTF-8 encoding. Every comment sent by user is registered in db. The problem start when saving text in db. I want the oriental language to be saved in unicode (starting with &#...) but latin characters in its ASCII form (note: it must be registered in one field). When I change the page encoding onto ISO-8859-1 everything looks fine. In database my text is as I require. Latin in ASCII and oriental text in unicode. But when I change the page on UTF-8 the than everything is saved in latin + extended latin.
I have created function in php converting all characters to be unicode. But then I have a problem because latin text (say english) is saved in unicode as well.
On my pages I have applied expression (in php) that highlights the oriental words. So when expression meets certain characters (that create unicode - &#...
Please let me know I will be very appreciated.
