Java regex vs. Unicode TR#18 vs. ICU

Xueming Shen xueming.shen at
Wed Mar 6 21:06:14 UTC 2013

On 03/06/2013 12:44 PM, Steven R. Loomis wrote:
> Hello,
>  Someone on the ICU team recently compared the use of "\w" between ICU, Java, and Unicode TR#18
> <> .
> The results are in the following ICU bug <>.
> A question for core-libs-dev is, does Java plan to change the semantics of \w to match TR#18's list?

It appears the "standard" has just added one more entry \p{Join_Control}
during their last update :-( I may consider to update the spec/impl to match
that, I would assume there is no any jdk7 application really has dependency
on the updated \w (in jdk7).


More information about the core-libs-dev mailing list