ProgrammingThis forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
1. Are there any known multibyte encodings (any encoding, not just those used in Linux/Unix) which use characters in the portable C character set as
1) the first byte of a shift byte sequence (for encodings with shift states) OR
2) the first byte of a multibyte char (for encodings without shift states)?
2. ISO 9899:1999 (Standard C specs) says,
"While in the initial shift state, all single-byte characters retain their usual interpretation and do not alter the shiftstate."
Does this mean that the portable character set (single byte characters) must have the same representation (bit pattern) while in the initial shift state of a arbitry locale as they did while in the default "C" locale?
3. Are there restrictions to the kind of encodings that can be used by locales in Linux? eg Does a multibyte encoding which has ascii '/' as part of a multibyte sequence allowed to be used in Linux locales?
ps: to moderators/ site owner -can we have another forum dedicated to Linux i18n?