Page 1 of 1

Charset nightmare

Posted: Wed Jun 28, 2006 9:21 am
by neophyte
before the caller could finish his question � and complete his

Do you see the funny character after the word "question"? I've got a lot of those non-utf8 characters in some data. They're curly quotes (single and double) and a few others. How can I clean those out of my data?

Posted: Wed Jun 28, 2006 9:51 am
by feyd
convert them to entity form?

Posted: Wed Jun 28, 2006 10:08 am
by neophyte
I converted them with the all_ascii function at the bottom of this page:

http://www.htmlforums.com/archive/index ... 71603.html

Posted: Wed Jun 28, 2006 10:22 am
by feyd
The curly quotes aren't in the standard ASCII set of characters, so they will either need to be converted to entity or UTF-8 versions.

Posted: Wed Jun 28, 2006 10:31 am
by neophyte
I'm actually working on porting some simple php blog content into Drupal. I've got it working now. I converted the non-utf-8 characters to utf-8 characters. It's looking good now in Drupal.