外来字符和LDAP. LDAP需要什么编码/字符集? [英] Foreign characters and LDAP. What encoding/charset does LDAP expect?
问题描述
我正在使用simplexml_load_string()
解析XML,并使用其中的数据通过LDAP更新Active Directory(AD)对象.
I am parsing XML, with simplexml_load_string()
, and using the data within it to update Active Directory (AD) objects, via LDAP.
示例XML(简体):
<?xml version="1.0" encoding="UTF-8"?>
<users>
<user>Bìlbö Bággįnš</user>
<user>Gãńdåłf Thê Gręât</user>
<user>Śām Wīšë</user>
</users>
我首先运行ldap_search()
来查找单个用户,然后继续更改其属性.使用LDAP将上述值直接输入到AD中,将导致出现一些很整齐的字符.
I firstly run an ldap_search()
to find a single user and then proceed to change their attributes. Pumping the above values straight into AD, using LDAP, will result in some pretty mangled characters showing up.
例如:Bìlbö BággįnÅ¡
我尝试了以下功能,但无济于事:
utf8_encode($str);
utf8_decode($str);
iconv("UTF-8", "ISO-8859-1//TRANSLIT", $str);
iconv("UTF-8", "ASCII//TRANSLIT", $str);
iconv("UTF-8", "T.61", $str);
理想情况下,我不想进行任何这些字符串转换. UTF-8 应该可以,对吧?!
Ideally, I don't want to do any of these string conversions. UTF-8 should be fine, right?!
我还注意到以下几点: 我已经打印出了值,看看它们是如何产生的.在CLI中卷曲脚本将显示正确的字符,但是Web浏览器显示与AD相同的字符.
I've also noticed the following: I have printed out the values to see how they come out. curl-ing the script in CLI will show the correct characters, but web browsers show the same as AD.
这是怎么回事?我是否应该看别的东西,例如.网址编码? 我希望这归结为一个简单的错误.
What's going on? Should I be looking at something else, eg. URL encoding? I'm hoping this is down to a simple mistake on my end.
我使用AD管理员GUI输入了这些字符,以查看它们的输出方式.我可以通过LDAP读取它们.在浏览器中显示正确的字符.通过CLI卷曲将显示问号而不是外来字符.将这些返回值之一传递到mb_detect_encoding()
中将返回UTF-8.
I entered in these characters using AD admin GUI to see how they would come out. I can read them via LDAP fine. Correct characters are displayed when in a browser. curl-ing via CLI will show question marks instead of foreign characters. Passing one of these returned values into mb_detect_encoding()
will return UTF-8.
我决定不写新字符串而是立即反转现有值并保存对象,从而立即修改同一对象.效果很好-我在广告中看到正确的值(取反).
I decided to immediately modify the same object by not writing in a new string, but just reversing the existing value and saving the object. This works fine - I see the correct value (reversed) in AD.
- 在Mac OS X 10.7 Lion-PHP 5.4.3上开发
- 在以下操作系统上运行生产:Red Hat 6-PHP 5.4.3
- AD服务器:Windows 2003
更新: 几个月后,我找不到该问题的答案/解决方案. 最后,我将字符替换为它们的非重音符号(我知道这不是理想的选择).
UPDATE: After a few months, I was unable to find the answer/solution to this problem. In the end, I went with replacing characters to their non-accented equivalent (NOT ideal, I know).
推荐答案
您是否正在使用LDAP v3?
Are you using LDAP v3?
ldap_set_option($ldap, LDAP_OPT_PROTOCOL_VERSION, 3);
LDAPv3默认情况下支持UTF-8,它期望请求和响应默认情况下处于其中.请参阅此处: http://technet.microsoft.com/en-us/library/cc961766.aspx
LDAPv3 supports UTF-8 by default, which it expects requests and responses to be in by default. See here: http://technet.microsoft.com/en-us/library/cc961766.aspx
这篇关于外来字符和LDAP. LDAP需要什么编码/字符集?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!