|
In the Linux/Unix world UTF-8 ist used and usally not UTF-16. (Just to mention that..)
And yes, you can find out the encoding of some character:
All ASCII (and iso-8859-1 btw) characters are in Unicode/UTF* like their counterparts in ASCII but just with leading 00 - if A is in ASCII 41 (in hex), it's translated into 0041. (More bits..)
Usally, you've got some encoding/recoding functions in most programming languages for this stuff.
See also the excellent Wikipedia articles about Unicode, UTF-16 and UTF-8.
|