Cmd到Powershell替换-特殊字符 [英] Cmd to powershell replace - special character

查看:102
本文介绍了Cmd到Powershell替换-特殊字符的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在创建一个脚本,该脚本将复制文件,重命名文件,然后向内看以删除某些特殊字符.这些特殊字符之一是某种我无法用键复制的ASCII单引号.我可以复制并粘贴它,但是替换功能不起作用.

I am creating a script that will copy a file, rename it and then look inside to remove certain special characters. One of these special characters is some sort of ASCII apostrophe that I cannot replicate with keys. I can copy and paste it though, however the replace function doesn't work.

打开文件>搜索奇怪的撇号,然后将其替换为任何内容.我希望将其替换为普通的撇号,但我不知道如何完成,目前最大的问题是我无法让它看到"自动生成的奇怪的撇号我正在修改的文件.任何帮助,不胜感激.谢谢:)

Opens file > Searches for strange apostrophe ’ and replaces with nothing. I'd like it to replace it with a normal apostrophe but I don't know how this is done, and at current the biggest problem is that I can't get it to "see" this strange apostrophe that winds up in the autogenerated file I'm modifying. Any help much appreciated. Thanks :)

文件中的撇号:

正常的撇号:'

这是我已隔离进行测试的批次的一部分.

This is a chunk of the batch that I've isolated to test with.

        @echo off

    set YYMMDD=%DATE:~-2,2%%DATE:~-7,2%%DATE:~-10,2%
    set DDMMYYYY=%DATE:~-10,2%%DATE:~-7,2%%DATE:~-4,4%
    set YYYY-MM-DD=%DATE:~-4,4%-%DATE:~-7,2%-%DATE:~-10,2%

powershell -Command "(gc 'C:\LOCATION\Client_List_%DDMMYYYY%.csv') -replace '’', '' | Out-File 'C:\LOCATION\Client_List_%DDMMYYYY%.csv'"

    Echo Done

推荐答案

set "fileIn=C:\LOCATION\Client_List_%DDMMYYYY%.csv"
set "fileOu=C:\LOCATION\Client_List_%DDMMYYYY%.csv"
powershell -c "(gc '%fileIn%').Replace('‘‘','').Replace('’’','')|Out-File '%fileOu%'"

奇怪的单引号U+2019 右单引号,据称是结束语.可以将其与其他开头引号配对.在上面的示例中,U+2018 左单引号.

That strange apostrophe is U+2019 Right Single Quotation Mark, supposedly a closing quote. It could be paired with a different opening quote. In above example, is U+2018 Left Single Quotation Mark.

Get-Help 'about_Quoting_Rules'

引号用于指定文字字符串.您可以附上 用单引号(')或双引号引起的字符串 (").

Quotation marks are used to specify a literal string. You can enclose a string in single quotation marks (') or double quotation marks (").

事实上,PowerShell接受两个不同的引号:

In fact, PowerShell accepts two different sets of quotes:

  • 双引号 "
  • 单引号 '
  • double quotation marks " " "
  • single quotation marks '

AFAIK,大多数Windows ANSI 代码页(1252、1250、1257、1253、1251、1254、1255、1256、1258)中都包含所有这些引号,因此可以在字面使用它们ANSI保存的.bat脚本-,除了后一个引号 U+201B 单个高反9引号 .在这种情况下,请使用$([char]0x201B)代替'‛‛',如下所示:

AFAIK, all those quotation marks are present in most Windows ANSI code pages (1252, 1250, 1257, 1253, 1251, 1254, 1255, 1256, 1258) so they may be used literally in ANSI-saved .bat script - except the latter quotation mark U+201B Single High-Reversed-9 Quotation Mark. In such case, use $([char]0x201B) instead of '‛‛' as follows:

rem        cast [char] to `[string]`    ↓↓↓↓↓↓↓↓
powershell -c "(gc '%fileIn%').Replace( [string]$([char]0x201B) , '')"
rem                                             ↑↑↑↑↑↑↑↑↑↑↑↑↑↑↑

或如下:

rem [char] can't be empty so specify `[string]`           ↓↓↓↓↓↓↓↓
powershell -c "(gc '%fileIn%').Replace( $([char]0x201B) , [string]'')"
rem                                     ↑↑↑↑↑↑↑↑↑↑↑↑↑↑↑

分析和解释

下一个PowerShell代码片段显示了Unicode数据库的摘录(字符名称以Quotation Mark结尾或包含Apostrophe):

Next PowerShell code snippet shows an excerpt from Unicode database (character names ending with Quotation Mark or containing Apostrophe):

PS D:> 0x22,0x27,0x00AB,0x00BB,0x2018,0x2019,0x201A,0x201B,0x201C,0x201D,0x201E,0x201F,
  0x2039,0x203A,0x2E42,0x301D,0x301E,0x301F,0x055A | Get-CharInfo | Format-Table -AutoSize

Char CodePoint                Category Description                               
---- ---------                -------- -----------                               
   " U+0022           OtherPunctuation Quotation Mark                            
   ' U+0027           OtherPunctuation Apostrophe                                
   « U+00AB    InitialQuotePunctuation Left-Pointing Double Angle Quotation Mark 
   » U+00BB      FinalQuotePunctuation Right-Pointing Double Angle Quotation Mark
   ‘ U+2018    InitialQuotePunctuation Left Single Quotation Mark                
   ’ U+2019      FinalQuotePunctuation Right Single Quotation Mark               
   ‚ U+201A            OpenPunctuation Single Low-9 Quotation Mark               
   ‛ U+201B    InitialQuotePunctuation Single High-Reversed-9 Quotation Mark     
   " U+201C    InitialQuotePunctuation Left Double Quotation Mark                
   " U+201D      FinalQuotePunctuation Right Double Quotation Mark               
   „ U+201E            OpenPunctuation Double Low-9 Quotation Mark               
   ‟ U+201F    InitialQuotePunctuation Double High-Reversed-9 Quotation Mark     
   ‹ U+2039    InitialQuotePunctuation Single Left-Pointing Angle Quotation Mark 
   › U+203A      FinalQuotePunctuation Single Right-Pointing Angle Quotation Mark
   ⹂ U+2E42           OtherNotAssigned Undefined                                 
   〝 U+301D            OpenPunctuation Reversed Double Prime Quotation Mark      
   〞 U+301E           ClosePunctuation Double Prime Quotation Mark               
   〟 U+301F           ClosePunctuation Low Double Prime Quotation Mark           
   ՚ U+055A           OtherPunctuation Armenian Apostrophe                       

(来自修改后的Get-CharInfo cmdlet的输出.)原始的Get-CharInfo模块可从 http://poshcode.org下载./5234 .

(Output from modified Get-CharInfo cmdlet.) Original Get-CharInfo module is downloadable from http://poshcode.org/5234.

下一个PowerShell脚本通过显示一些有效的引号组合(在我的语言环境中无效)来完成上述结果:

Next PowerShell script completes above results by showing some valid (and invalid in my locale) combinations of quotes:

$arrSingleQuotes = 
 ''' U+0027 Apostrophe '''                                ,
 ‘‘‘ U+2018 Left Single Quotation Mark ‘‘‘                ,
 ’’’ U+2019 Right Single Quotation Mark ’’’               ,
 ‚‚‚ U+201A Single Low-9 Quotation Mark ‚‚‚               ,
 ‛‛‛ U+201B Single High-Reversed-9 Quotation Mark ‛‛‛     ,
 ‘‘‘ U+2018 (Left/Right) Single Quotation Mark U+2019 ’’’ ,
 ’’’ U+2019 (Right/Left) Single Quotation Mark U+2018 ‘‘‘
'$arrSingleQuotes (any combination)'
 $arrSingleQuotes

$arrDoubleQoutes = 
 """ U+0022 Quotation Mark """                            ,
 """ U+201C Left Double Quotation Mark """                ,
 """ U+201D Right Double Quotation Mark """               ,
 „„„ U+201E Double Low-9 Quotation Mark „„„               ,
 """ U+201C (Left/Right) Double Quotation Mark U+201D """ ,
 """ U+201D (Right/Left) Double Quotation Mark U+201C """
'$arrDoubleQoutes (any combination)'
 $arrDoubleQoutes

$noQuotes = @"
 « U+00AB Left-Pointing Double Angle Quotation Mark
 » U+00BB Right-Pointing Double Angle Quotation Mark
 ‟ U+201F Double High-Reversed-9 Quotation Mark
 ⹂ U+2E42 DOUBLE LOW-REVERSED-9 QUOTATION MARK
 ‹ U+2039 Single Left-Pointing Angle Quotation Mark
 › U+203A Single Right-Pointing Angle Quotation Mark
〝 U+301D Reversed Double Prime Quotation Mark
 〞U+301E Double Prime Quotation Mark
 〟U+301F Low Double Prime Quotation Mark
 ՚ U+055A Armenian Apostrophe                       
"@
'$noQuotes'
 $noQuotes

输出:

PS D:> D:\PShell\SO\41488245_quotes.ps1

$arrSingleQuotes (any combination)
' U+0027 Apostrophe '
‘ U+2018 Left Single Quotation Mark ‘
’ U+2019 Right Single Quotation Mark ’
‚ U+201A Single Low-9 Quotation Mark ‚
‛ U+201B Single High-Reversed-9 Quotation Mark ‛
‘ U+2018 (Left/Right) Single Quotation Mark U+2019 ’
’ U+2019 (Right/Left) Single Quotation Mark U+2018 ‘

$arrDoubleQoutes (any combination)
" U+0022 Quotation Mark "
" U+201C Left Double Quotation Mark "
" U+201D Right Double Quotation Mark "
„ U+201E Double Low-9 Quotation Mark „
" U+201C (Left/Right) Double Quotation Mark U+201D "
" U+201D (Right/Left) Double Quotation Mark U+201C "

$noQuotes
 « U+00AB Left-Pointing Double Angle Quotation Mark
 » U+00BB Right-Pointing Double Angle Quotation Mark
 ‟ U+201F Double High-Reversed-9 Quotation Mark
 ⹂ U+2E42 DOUBLE LOW-REVERSED-9 QUOTATION MARK
 ‹ U+2039 Single Left-Pointing Angle Quotation Mark
 › U+203A Single Right-Pointing Angle Quotation Mark
〝 U+301D Reversed Double Prime Quotation Mark
 〞U+301E Double Prime Quotation Mark
 〟U+301F Low Double Prime Quotation Mark
 ՚ U+055A Armenian Apostrophe                       

请注意,⹂ U+2E42 DOUBLE LOW-REVERSED-9 QUOTATION MARK存在于Unicode数据库中,并已在PowerShell ISE中正确呈现.

Note that ⹂ U+2E42 DOUBLE LOW-REVERSED-9 QUOTATION MARK is present in Unicode database and is properly rendered in PowerShell ISE.

附录:我发现了更多引号的候选对象(仅显示了从Excerpt_From_UnicodeDataTxt.ps1脚本获得的结果):

Addendum: I found more candidates of quotation marks (shown merely result obtained from Excerpt_From_UnicodeDataTxt.ps1 script):

PS > $x = .\tests\Excerpt_From_UnicodeDataTxt.ps1 -SearchString "Quotation|Apostrophe" | 
    Where-Object {$_.Category -match 'Punctuation'}

PS > $x.Count
23

PS > $x

Char CodePoint Category                   Description                                       
---- --------- --------                   -----------                                       
   " U+0022    Po-OtherPunctuation        Quotation Mark                                    
   ' U+0027    Po-OtherPunctuation        Apostrophe                                        
   « U+00AB    Pi-InitialQuotePunctuation Left-Pointing Double Angle Quotation Mark         
   » U+00BB    Pf-FinalQuotePunctuation   Right-Pointing Double Angle Quotation Mark        
   ՚ U+055A    Po-OtherPunctuation        Armenian Apostrophe                               
   ‘ U+2018    Pi-InitialQuotePunctuation Left Single Quotation Mark                        
   ’ U+2019    Pf-FinalQuotePunctuation   Right Single Quotation Mark                       
   ‚ U+201A    Ps-OpenPunctuation         Single Low-9 Quotation Mark                       
   ‛ U+201B    Pi-InitialQuotePunctuation Single High-Reversed-9 Quotation Mark             
   " U+201C    Pi-InitialQuotePunctuation Left Double Quotation Mark                        
   " U+201D    Pf-FinalQuotePunctuation   Right Double Quotation Mark                       
   „ U+201E    Ps-OpenPunctuation         Double Low-9 Quotation Mark                       
   ‟ U+201F    Pi-InitialQuotePunctuation Double High-Reversed-9 Quotation Mark             
   ‹ U+2039    Pi-InitialQuotePunctuation Single Left-Pointing Angle Quotation Mark         
   › U+203A    Pf-FinalQuotePunctuation   Single Right-Pointing Angle Quotation Mark        
   ❮ U+276E    Ps-OpenPunctuation         Heavy Left-Pointing Angle Quotation Mark Ornament 
   ❯ U+276F    Pe-ClosePunctuation        Heavy Right-Pointing Angle Quotation Mark Ornament
   ⹂ U+2E42    Ps-OpenPunctuation         Undefined                                         
   〝 U+301D    Ps-OpenPunctuation         Reversed Double Prime Quotation Mark              
   〞 U+301E    Pe-ClosePunctuation        Double Prime Quotation Mark                       
   〟 U+301F    Pe-ClosePunctuation        Low Double Prime Quotation Mark                   
   " U+FF02    Po-OtherPunctuation        Fullwidth Quotation Mark                          
   ' U+FF07    Po-OtherPunctuation        Fullwidth Apostrophe                              

这篇关于Cmd到Powershell替换-特殊字符的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆