使用ASP.Net C＃使用iTextSharp在PDF文件中查找字符串和位置 [英] Find a string and location in PDF file using iTextSharp using ASP.Net C#

查看：454 发布时间：2018/11/16 17:18:10 c# asp.net pdf itext

本文介绍了使用ASP.Net C＃使用iTextSharp在PDF文件中查找字符串和位置的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试使用Asp.net C＃中的iTextSharp查找字符串及其在PDF中的位置进行编辑。但到目前为止，借助Google提供的帮助，我无法做到。这是当前的代码，但它确实通过块读取文本块，但找不到所需的文本。需要帮助谢谢

I am trying to find a string and it's location in a PDF using iTextSharp in Asp.net C# for editing. But so far with the help available on Google I am unable to do it. This is the current code but it does read text chunk by chunk but couldn't find the required text. Need help Thanks

public class RectAndText
    {
        public iTextSharp.text.Rectangle Rect;
        public String Text;
        public RectAndText(iTextSharp.text.Rectangle rect, String text)
        {
            this.Rect = rect;
            this.Text = text;
        }
    }

    public class MyLocationTextExtractionStrategy : LocationTextExtractionStrategy
    {
        public List<RectAndText> myPoints = new List<RectAndText>();
        public String TextToSearchFor { get; set; }
        public System.Globalization.CompareOptions CompareOptions { get; set; }

        public MyLocationTextExtractionStrategy(String textToSearchFor, System.Globalization.CompareOptions compareOptions = System.Globalization.CompareOptions.None)
        {
            this.TextToSearchFor = textToSearchFor;
            this.CompareOptions = compareOptions;
        }
        public override void RenderText(TextRenderInfo renderInfo)
        {
            base.RenderText(renderInfo);
            var startPosition = System.Globalization.CultureInfo.CurrentCulture.CompareInfo.IndexOf(renderInfo.GetText(), this.TextToSearchFor, this.CompareOptions);
            if (startPosition < 0)
            {
                return;
            }
            var chars = renderInfo.GetCharacterRenderInfos().Skip(startPosition).Take(this.TextToSearchFor.Length).ToList();
            var firstChar = chars.First();
            var lastChar = chars.Last();

            var bottomLeft = firstChar.GetDescentLine().GetStartPoint();
            var topRight = lastChar.GetAscentLine().GetEndPoint();
            var rect = new iTextSharp.text.Rectangle(
                                                    bottomLeft[Vector.I1],
                                                    bottomLeft[Vector.I2],
                                                    topRight[Vector.I1],
                                                    topRight[Vector.I2]
                                                    );

            this.myPoints.Add(new RectAndText(rect, this.TextToSearchFor));
        }
    }

通话功能

string thisDir = System.Web.Hosting.HostingEnvironment.MapPath("~/");
var testFile = thisDir + "example.pdf";
var t = new MyLocationTextExtractionStrategy("searchstring"); //need to search this searchstring 

using (var r = new PdfReader(testFile))
{
   var ex = PdfTextExtractor.GetTextFromPage(r, 1, t);
}

foreach (var p in t.myPoints)
{
   Console.WriteLine(string.Format("Found text {0} at {1}x{2}", p.Text, p.Rect.Left, p.Rect.Bottom));
}

使用ASP.Net C＃使用iTextSharp在PDF文件中查找字符串和位置 [英] Find a string and location in PDF file using iTextSharp using ASP.Net C#

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录关闭

使用ASP.Net C＃使用iTextSharp在PDF文件中查找字符串和位置 [英] Find a string and location in PDF file using iTextSharp using ASP.Net C#

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录 关闭

登录关闭