如何使用sed提取定界符之间的字符? [英] How to extract characters between the delimiters using sed?

查看：67 发布时间：2021/5/29 22:13:20 linux scripting sed

本文介绍了如何使用sed提取定界符之间的字符?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我刚刚开始学习sed.我想提取并打印 > 和 < 之间的字符.定界符.这是我的数据文件中的文本:

I have just started learning sed. I want to extract and print the characters between the > and < delimiters. Here the text in my data file:

<span id="ctl00_ContentPlaceHolder1_lblRollNo">12029</span>

   <br /><b>Engineering & IT/Computer Science</b><br />

        <div id="ctl00_ContentPlaceHolder1_divEngITMerit">

                        <span id="ctl00_ContentPlaceHolder1_lblEngITSelListNo">3rd Provisional Selection List</span>

                <tr><td style='width: 200px' class='TblTRData'>IT/Computer Science/Software</td><td style='width: 150px'class='TblTRData'>7 (out of 471)</td><td style='width: 325px'class='TblTRData'>Selected in MS COMPUTER SCIENCE</td></tr>

                                Name:

                                <span id="ctl00_ContentPlaceHolder1_lblName">SIDRA SHAHID</span>

                                Father Name:

                                <span id="ctl00_ContentPlaceHolder1_lblFatherName">SHAHID RAFEEQ AHMAD</span>

我已经写了命令:

sed -n -e '/^[^>]*>\([^<]*\)<.*/s//\1/p' myfile.txt

问题在于它正在返回某些<<之间的文本.例如，它会打印 12029 ，但不会在在MS COMPUTER SCIENCE中选择中被选中.我在做什么错了?

The problem is that it is returning the text between some of the > <. For example, it prints 12029, but not Selected in Selected in MS COMPUTER SCIENCE. What am I doing wrong?

推荐答案

如果您只需要提取标记之间的字符串，则意味着您需要删除标记，使它们之间的字符串保持不变.对吧?

If you need to extract only strings between tags, this means you need to delete tags leaving strings between them untouched. Right?

sed 's/<[^>]*>//g'

它将(所有出现的)标记(<"下一个>"之后的所有内容)替换为空字符串(不包含任何内容).文本将保留.

It substitutes (all occurrences) of tag ( "<" everything upon next ">" ) with empty string (nothing). Text will remain.

这篇关于如何使用sed提取定界符之间的字符?的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

如何使用sed提取定界符之间的字符? [英] How to extract characters between the delimiters using sed?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录关闭

如何使用sed提取定界符之间的字符? [英] How to extract characters between the delimiters using sed?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录 关闭

登录关闭