drop ambiguous-wide behaviour from Unicode CJK locales

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

drop ambiguous-wide behaviour from Unicode CJK locales

Thomas Wolff
It seems that ambiguous-wide behaviour (i.e. double-width property for
characters in the East Asian Ambiguous width category) for CJK locales
with UTF-8 encoding is inconsistent with Linux locale definitions.
The attached patch changes that. Characters like ─ ü æ are no longer
wide in the following locales:
ja_JP.utf8
ko_KR.utf8
zh_*.utf8
but only in ja, ko, zh locales with legacy encoding. Explicit modifiers
@cjkwide and @cjknarrow are not affected.
Thomas

0001-drop-ambiguous-wide-behaviour-from-Unicode-CJK-local.patch (3K) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: drop ambiguous-wide behaviour from Unicode CJK locales

Corinna Vinschen-2
On Oct  7 18:55, Thomas Wolff wrote:

> It seems that ambiguous-wide behaviour (i.e. double-width property for
> characters in the East Asian Ambiguous width category) for CJK locales with
> UTF-8 encoding is inconsistent with Linux locale definitions.
> The attached patch changes that. Characters like ─ ü æ are no longer wide in
> the following locales:
> ja_JP.utf8
> ko_KR.utf8
> zh_*.utf8
> but only in ja, ko, zh locales with legacy encoding. Explicit modifiers
> @cjkwide and @cjknarrow are not affected.
> Thomas

Pushed (but this should go to the newlib list in the first place).


Thanks,
Corinna