Finding characters

Ye' old general discussion board. Basically, for everything that isn't covered elsewhere. Come here to shoot the breeze, shoot your mouth off, or whatever suits your fancy.
This forum is not for asking programming related questions.

Moderator: General Moderators

Post Reply
matthijs
DevNet Master
Posts: 3360
Joined: Thu Oct 06, 2005 3:57 pm

Finding characters

Post by matthijs »

How do you I find out which characters are these:

Code: Select all

â€"
… 
….
There's probably some table or online converter tool somewhere? Or some real geek out here who recognizes these? :)
My own guess, since they where found on some website, that they might have been accents or quotes
User avatar
Apollo
Forum Regular
Posts: 794
Joined: Wed Apr 30, 2008 2:34 am

Re: Finding characters

Post by Apollo »

Alas, they don't mean anything without knowing what encoding you use :(

However they look like incorrectly encoded UTF-8 characters. The first is incorrect (it's not valid UTF-8), but the byte sequence 0xE2 0x80 0xA6 (2nd line) is the "three dot" character (that is "…", one char!) encoded in UTF-8. The 3rd line is the same + an extra normal dot, so that would be "…."

But then again, it might just as well have been iso-8859-1 encoding, and in that case they mean just the characters as they are displayed in your post...
User avatar
superdezign
DevNet Master
Posts: 4135
Joined: Sat Jan 20, 2007 11:06 pm

Re: Finding characters

Post by superdezign »

Try changing the encoding on the page that they are displaying on, in order to see if they show up differently.
matthijs
DevNet Master
Posts: 3360
Joined: Thu Oct 06, 2005 3:57 pm

Re: Finding characters

Post by matthijs »

@superdezign: I don't have the original page, that's the problem. So I need to reverse-engineer/guess.

@Apollo: it could be the three dots. I will check it out. Is there some character map or online tool to help translate different characters? For example, how do you find out those byte sequences?
Post Reply