泰文断行：如何有效打破泰文 [英] Thai line breaking: how to break Thai text effectively

查看：1337 发布时间：2017/2/6 11:30:46 html css unicode internationalization browser-bugs

本文介绍了泰文断行：如何有效打破泰文的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

情况
在客户网站上使用泰文，我们无法控制特定字词/句子在行之间的分界线（如何使用网络浏览器处理它）。通常，本地评论者的内容显示方式不正确。

Situation with Thai text on a client site is that we can't control where exactly particular words/sentences are going to break between the lines (how web browser will handle it). Often, content appearance is indicated as incorrect by local reviewers.

解决方法
这是文案写作者需要传送泰文内容打破 &＃8203; 和不间断的 &＃65279;

Workaround to this is that copywriter needs to deliver Thai content with breaking  and non-breaking  zero-width-space chars included.

在实践中，而不是：

ของเพื่อน ๆที่ออนไลน์อยู่

我们应该使用丑陋的东西：

ข&＃65279;อ& ＃65279;ง&＃65279;เ&＃65279;亮่&＃65279;อ&＃65279;น&＃65279;ๆ&＃8203;ที่&＃8203;อ&＃65279 ;อ&＃65279;น&＃65279;ไ&＃65279;ล&＃＃65279;น์&＃65279;อ&＃65279;ยู่

In practice, rather than:
ของเพื่อนๆ ที่ออนไลน์อยู่
we should use something as ugly as:
ของเพื่อนๆที่ออนไลน์อยู่

上面只是一个例子，我真的不知道断点是在哪里允许的。

事实上，或者...它只是更严格和正确使用破碎的，以及更好的准确性。

The above is just an example, I don't really know where exactly the breakpoints are allowed.
In fact, non-breaking zero spaces alone would do the trick either ... it's just more strict and correct to use breaking ones as well for better accuracy.

虽然它绝对是可行的，这也是一个时间消耗而不是非常有效的解决方案，为大型网站内容管理。简单地说，投入的努力与所需的效果不匹配。

And while it definitely is doable like this, it also is a time consuming and not very effective solution for a large site content management. Simply said, the effort put into it doesn't match the effect needed.

研究
到目前为止已导致解决方法提到，寻找一个更好的方法如何处理这个。即使 W3C也没有解决方案，而只是讨论是否应该是CSS3规范的一部分。

Research so far has lead to the workaround mentioned, looking for a better way how to handle this. Even W3C doesn't have a solution yet and is just discussing whether it should be part of CSS3 specification.

泰语很少使用空格，主要用于区分句子等。因此，泰语句的常见外观是一个字符串。
当更多的文本行放在一起时，打破这样的字符串的位置由特定的单词标识决定。对于单词识别，使用的本地词典最有可能是操作系统或Web浏览器的一部分，我不完全确定这些。

Thai language utilizes spaces very rarely, mostly to distinguish between sentences etc. Therefore, common appearance of a Thai sentence is one looong string. Where to break such a string when more lines of text are put together is determined by particular words identification. For words identification local dictionaries are used which are most probably part of operating system or web browser, I'm not entirely sure about these.

显然，更多的web浏览器/操作系统你检查更多的结果，你得到！

Apparently, the more web browsers / operating systems you check on the more results you get! Moreover, there's not much you can do about this as it's system driven and there are no "where to break Thai" settings available.

使用 < wbr /> ， &＃8203; 或 & shy; 表示断点真正的位置不会阻止网络浏览器的思考（即使错误） / em>，有些休息也是可能的地方，你没有定义他们eg

Using <wbr/>,  or  to indicate where the breakpoints really are won't prevent web browser thinking (even though wrong) that some breaks are also possible in places, where you haven't defined them e.g. in the middle of a word which might be grammatically incorrect.

如果这样的单词放置在行的末尾，取决于屏幕分辨率，复制长度，CSS规则定义），并且浏览器应用他错误的换行规则，那么你将会遇到一个泰语换行问题，无论你之前，之后还是其他地方定义了另一个断点 - 浏览器将总是使用他认为最接近EOL的断点，而不只是通过在标记中插入一个提到的字符而轻轻地建议的断点。

If such a word is placed at the end of a line (depends on screen resolution, copy length, CSS rules defined) and the browser applies his wrong line breaking rule on it then you would end up with a Thai line breaking issue, no matter that you have defined another breakpoints before, after or somewhere else in the word - browser will always use a breakpoint that he thinks is closest to EOL, not just the ones you have gently suggested by inserting one of the mentioned chars in your markup.

这就是为什么你实际上需要集中在哪里不打破你的文本（非零零宽度空间），而不是它的允许。这就是导致我们回到上面解决方法部分中的丑陋和长标记示例。

That's why you actually need to focus on where not to break your text (non-breaking zero-width-space), not where it's allowed. And that's what lead us back to the ugly and long markup example in the "Workaround" section above. That way a line break can strictly only occur where you have allowed it to be, but it's messy.

任何其他解决方案
如何更有效地处理这个问题会非常感谢...谁知道，它甚至可以帮助W3C实现它们？

Any other solution how to handle this more effectively would be appreciated ... and who knows, it might even help W3C in their implementation?

谢谢！

泰文断行：如何有效打破泰文 [英] Thai line breaking: how to break Thai text effectively

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录关闭

泰文断行：如何有效打破泰文 [英] Thai line breaking: how to break Thai text effectively

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录 关闭

登录关闭