将语言字符转换为拉丁字母 [英] Convert language characters to Latin alphabet

查看:56
本文介绍了将语言字符转换为拉丁字母的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我尝试编写一个应用程序来学习外国字符.例如,如果你想学习日语,那么你就必须记住所有的平假名、片假名和汉字.(例如:あ、い、か... = 平假名 カ、サ、ケ... = 片假名本、学... = 汉字)

I try to program an application to learn foreign characters. If you for example want to learn Japanese, then you'd have to memorize all the Hiragana, Katakana and Kanji letters. (e.g あ、い、か... = Hiragana カ、サ、ケ... = Katakana 本、学... = Kanji).

示例:一些用户正在尝试学习日语.所以他必须学习:か = ka
本 = hon,意思是:基础/书/这个

Example: Some user is trying to learn Japanese. So he has to learn: か = ka
本 = hon, meaning: basis/ book/ this

而且他还要学习发音.

我的第一个问题是是否有任何图书馆或其他东西可以在 .net 中轻松完成?我还查看了 Microsoft IME,但我真的不知道如何在我的项目中使用它.

My first question is if there is any library or something to easily do it in .net? I also looked at Microsoft IME, but I couldn't really find out how I could use it in my project.

我还查看了 Unicode 数据库,它基本上是可能的.我还设法编写了一个将 か 转换为 ka 的项目.唯一缺少的是含义(例如本=basis/book/this),Unicode 数据库也提供了这些含义.不幸的是,我无法在我从中获取 UCD 数据的 .XML 文件中找到它们.当我在 UCD-网站.

I also looked at the Unicode database and it's basically possible to it. I also managed to write a Project to convert か to ka. The only Thing that is missing are the meanings (for example 本=basis/ book/ this), which are also provided by the Unicode database. I unfortunately couldn't find them in my .XML file from which I get the UCD data. It works, when I enter the word on the UCD-Website.

另一种方法是使用 CLDR-Library,它似乎也与 UCD 相关.不幸的是,我无法弄清楚我应该使用这两个(UCD 或 CLDR)中的哪一个.CLDR:http://cldr.unicode.org/

Another approach was to use the CLDR-Library which also seems to be related to UCD. I unfortunately couldn't figure out which of the two (UCD or CLDR) I should use. CLDR: http://cldr.unicode.org/

我的问题是 UCD 是否是最好的方法,我是否也可以使用 CLDR.

My question is if UCD is the best way to do it and if I could also use CLDR.

我真的不想使用我自己输入所有字符的普通列表.这会花费太多时间,尤其是所有汉字(超过 10,000 个).

I don't really want to work with normal lists where I just type in all the characters myself. It would take too much time, especially for all the Kanji letters (more than 10,000).

谢谢

我解决了它,我从 Unicode 字符数据库 (UCD) 中提取了信息.您可以在 .XML 文件中下载整个数据库.我只需要学习如何处理它并找到正确的属性.

I solved it, I extract the information from the Unicode Character Database (UCD). You can download the whole database in a .XML file. I just needed to learn how to handle it and find the correct attributes.

推荐答案

Google 和 Microsoft 都提供 API,您可以调用它来翻译文本.例如 http://www.microsoft.com/en-us/translator/translatorapi.aspx

Both Google and Microsoft offer APIs you can call to translate text. eg http://www.microsoft.com/en-us/translator/translatorapi.aspx

根据您选择的服务类型,可能需要支付少量费用.他们还为翻译提供声音.无需重新发明这个轮子.:-)

Depending the type of service you choose a small fee might be required. They also offer sounds for the translation. No need to re-invent this wheel. :-)

如果这是一个 CodePage 类型的问题.这个博客是一个有趣的起点http://www.joelonsoftware.com/articles/Unicode.html

If this was a CodePage type question. this blog is an amusing place to start http://www.joelonsoftware.com/articles/Unicode.html

回应有关选项的评论.Google 可以提供多种可能的翻译

in response to comment about options. Google can supply several possible translations

例如对于本

这篇关于将语言字符转换为拉丁字母的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆