Ruby 1.9:我怎样才能正确地大写 &小写多字节字符串? [英] Ruby 1.9: how can I properly upcase & downcase multibyte strings?

查看:20
本文介绍了Ruby 1.9:我怎样才能正确地大写 &小写多字节字符串?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

因此 matz 决定在 ruby​​ 1.9.1 中将 upcasedowncase 限制为 /[AZ]/i.p>

ActiveSupport::Multibyte 长期以来在 ruby​​ 1.8.x 中通过 String#mb_chars 实现了出色的 i18n 大小写.

但是,在 ruby​​ 1.9.1 下尝试时,它似乎不起作用.这是我编写的一个简单的测试脚本,以及我得到的输出:

$ cat test.rb# 编码:UTF-8puts("@ #{RUBY_VERSION} " + (__ENCODING__ 拯救 $KCODE).to_s)sd, su = "Iñtërnâtiônàlizætiøn", "IÑTËRNÂTIÔNÀLIZÆTIØN"定义 ps(u, d, k);将 "%-30s: %24s/%-24s" % [k, u, d] 结束ps sd.upcase, su.downcase, "纯红宝石"需要红宝石";需要'active_support'ps sd.upcase, su.downcase, "使用 active_support"ps sd.mb_chars.upcase.to_s, su.mb_chars.downcase.to_s, "使用 active_support mb_chars"$红宝石-KU test.rb@ 1.8.7 UTF8纯红宝石 : IñTëRNâTIôNàLIZæTIøN/iÑtËrnÂtiÔnÀlizÆtiØn使用 active_support : IñTëRNâTIôNàLIZæTIøN/iÑtËrnÂtiÔnÀlizÆtiØn使用 active_support mb_chars:IÑTËRNÂTIÔNÀLIZÆTIØN/iñtërnâtiônàlizætiøn$ ruby​​1.9 test.rb@ 1.9.1 UTF-8纯红宝石 : IñTëRNâTIôNàLIZæTIøN/iÑtËrnÂtiÔnÀlizÆtiØn使用 active_support : IñTëRNâTIôNàLIZæTIøN/iÑtËrnÂtiÔnÀlizÆtiØn使用 active_support mb_chars : IñTëRNâTIôNàLIZæTIøN/iÑtËrnÂtiÔnÀlizÆtiØn

那么,如何使用 ruby​​ 1.9.1 获得国际化的 upcasedowncase?

更新

我应该补充一点,我还使用当前 master2-3-*3-0-unstable 轨道的 ActiveSupport 进行了测试GitHub 上的分支.结果相同.

解决方案

大小写转换依赖于语言环境,并不总是往返,这就是为什么 Ruby 1.9 没有涵盖它(参见 这里这里)

unicode-util gem 应该可以满足您的需求.

So matz made the decision to keep upcase and downcase limited to /[A-Z]/i in ruby 1.9.1.

ActiveSupport::Multibyte has long had great i18n case jiggering in ruby 1.8.x via String#mb_chars.

However, when tried under ruby 1.9.1, it doesn't seem to work. Here's a simple test script I wrote, along with the output I'm getting:

$ cat test.rb
# encoding: UTF-8

puts("@ #{RUBY_VERSION} " + (__ENCODING__ rescue $KCODE).to_s)
sd, su = "Iñtërnâtiônàlizætiøn", "IÑTËRNÂTIÔNÀLIZÆTIØN"
def ps(u, d, k); puts "%-30s:  %24s / %-24s" % [k, u, d] end
ps sd.upcase, su.downcase, "Plain ruby"

require 'rubygems'; require 'active_support'
ps sd.upcase, su.downcase, "With active_support"
ps sd.mb_chars.upcase.to_s, su.mb_chars.downcase.to_s, "With active_support mb_chars"

$ ruby -KU test.rb
@ 1.8.7 UTF8
Plain ruby                    :  IñTëRNâTIôNàLIZæTIøN / iÑtËrnÂtiÔnÀlizÆtiØn
With active_support           :  IñTëRNâTIôNàLIZæTIøN / iÑtËrnÂtiÔnÀlizÆtiØn
With active_support mb_chars  :  IÑTËRNÂTIÔNÀLIZÆTIØN / iñtërnâtiônàlizætiøn

$ ruby1.9 test.rb
@ 1.9.1 UTF-8
Plain ruby                    :      IñTëRNâTIôNàLIZæTIøN / iÑtËrnÂtiÔnÀlizÆtiØn
With active_support           :      IñTëRNâTIôNàLIZæTIøN / iÑtËrnÂtiÔnÀlizÆtiØn
With active_support mb_chars  :      IñTëRNâTIôNàLIZæTIøN / iÑtËrnÂtiÔnÀlizÆtiØn

So, how do I get internationalized upcase and downcase with ruby 1.9.1?

update

I should add that I also tested with ActiveSupport from the current master, 2-3-* and 3-0-unstable rails branches at GitHub. Same results.

解决方案

Case conversion is locale dependent and doesn't always round-trip, which is why Ruby 1.9 doesn't cover it (see here and here)

The unicode-util gem should address your needs.

这篇关于Ruby 1.9:我怎样才能正确地大写 &小写多字节字符串?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆