Page 1 of 1
Language : ISO / UTF ?
Posted: Mon Feb 14, 2005 5:44 am
by anjanesh
Im trying to get a site of mine display in indian languages.
Im new to this utf / iso language concept.
I understand for hindi its utf-8 but whats the font specified ? By sending Content utf-8 the font will be automatically detected or do we have to specify one ?
BTW, I checked up W3C and unicode - couldn't find anything for Tamil and Malayalam characters - are there any specs on these ? Or just simple font change in HTML tags ?
Thanks
Posted: Mon Feb 14, 2005 7:48 am
by n00b Saibot
see utf has definition for characters of many langauges.
if simply start the charmap program in windows you can see them in the font Arial Unicode MS in the Devanagri section.

Posted: Mon Feb 14, 2005 8:44 am
by feyd
you have to specify that the page is of a certain charset if you want proper submissions and reading. In your case, you need to specify the content as being utf8 encoded.
Posted: Fri Feb 18, 2005 2:27 am
by anjanesh
Ok. So font doesnt have to be specified.
In this example why is Test being shown as Test in English ? Shouldnt it be shown with the Hindi characters ?
Code: Select all
<!DOCTYPE HTML PUBLIC "-//W3C//DTD XHTML 1.0 Frameset//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-frameset.dtd">
<HTML XMLNS="http://www.w3.org/1999/xhtml" XML:LANG="hi" LANG="hi" DIR="ltr">
<HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html;charset=hindi-utf-8" />
<TITLE>Hindi</TITLE>
</HEAD>
<BODY>Test</BODY>
</HTML>
Posted: Fri Feb 18, 2005 2:57 am
by Maugrim_The_Reaper
Not completely certain - but the encoding specifies the character set you used. So assuming you wrote using utf-8 compatible characters, the browser should show them correctly.
However Test are not hindi characters, are they? So why would your encoding alter how they are viewed? Western/Roman characters are part of utf-8 too...so they'll be displayed as is. You need to literally write in hindi characters with a unicode font setup on your editor I assume...
Posted: Fri Feb 18, 2005 2:59 am
by Maugrim_The_Reaper
Posted: Fri Feb 18, 2005 3:19 am
by anjanesh
Im using a plain text-editor.
Im trying to understand this using the the script in phpMyAdmin 2.6.0 pl3 in the file : lang/hindi-utf-8.inc.php.
I opened it in the editor and saw the chars. Take for example on Line 30:
$strAction = ' कारà¥
Posted: Fri Feb 18, 2005 6:16 am
by n00b Saibot
anjanesh wrote:But I dont expect to type the entire sentence using the dec/hex codes.
That's the way it is baybee.
if you wanna type normally from keyboard you gotta use some of the readily available hindi fonts.

Posted: Sat Feb 19, 2005 1:34 am
by anjanesh
But according to this :
http://www.alanwood.net/unicode/devanagari.html
A stands for अ
AA stands for आ
So is there anyway to type AA in a plain text editor and get output आ in the browser ?
Posted: Sat Feb 19, 2005 7:46 am
by feyd
if the editor supports unicode, you can do the alt-keypad typing, as is, a unicode editor with typing AA will produce four bytes instead of two, typically.
Posted: Sat Feb 19, 2005 9:06 am
by anjanesh
Im not looking for to type आ in a text editor. Something like this (say in notepad or Context):
Code: Select all
<META HTTP-EQUIV="Content-Type" CONTENT="text/html;charset=utf-8" />... and all that
<?
$strLetter="AA";
echo $strLetter;
?>
The output in any browser (IE / FF) should be आ
Posted: Mon Feb 21, 2005 3:41 am
by n00b Saibot
Hey, where does it say that A
'stands for' अ , huh?
What it only says is this that A
'is Devanagri Name for' अ. OKay

Not the character for typing अ .
