且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

将CJK音译为拉丁语-***使用C ++

更新时间:2023-02-26 12:05:18

ICU: http://userguide.icu-project.org/transforms/general ,现在ICU 50具有CJK单词分段功能. uconv样本可与uconv -f utf-8 -t utf-8 -x 'Any-Latin'之类的东西一起使用以进行Any-Latin变换.不过,这并未考虑语言.

ICU: there are examples in http://userguide.icu-project.org/transforms/general and ICU 50 now has CJK word segmentation. The uconv sample can be used with something like uconv -f utf-8 -t utf-8 -x 'Any-Latin' to go through Any-Latin transform. That doesn't take language into account, though.