PDFBOX 2.0.18 - 如何遍历 PDF 页面并检索特定字段 [英] PDFBOX 2.0.18 - How to iterates through pages of a PDF and retrieve specific fields

查看：109 发布时间：2021/6/15 18:34:32 java pdf pdfbox

本文介绍了PDFBOX 2.0.18 - 如何遍历 PDF 页面并检索特定字段的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用 PDFBox 读取 pdf 文档上的特定字段.实际上，我可以使用仅包含一页的 pdf 获取我想要的所有信息.PDF 具有特定名称的字段，我可以获取所有字段并将其插入数据库.

I'm using PDFBox to read specific fields on a pdf document. Actually, I'm able to get all the informations I want with a pdf containing only one page. The PDF has fields with specific names and I can get all the fields and insert it in a database.

我将此代码与 AccroForm 一起使用以访问字段

I use this code with AccroForm to access the fields

InputStream document = item.getInputStream();
pdf = PDDocument.load(new RandomAccessBufferedFileInputStream(document));
pdCatalog = pdf.getDocumentCatalog();
pdAcroForm = pdCatalog.getAcroForm();

String dateRapport = pdAcroForm.getField("import_Date01").getValueAsString();
String radioReason = pdAcroForm.getField("NoFlight").getValueAsString();
boolean hasdata = false;

if(radioRaison.length() > 0 && !radioRaison.equals("Off")) {
    if(radioRaison.equals("NR")) {
        rvhi.setRaison(obtenirRaison(raisons, "NR"));
    }else if(radioRaison.equals("WX")) {
        rvhi.setRaison(obtenirRaison(raisons, "ME"));
    }else if(radioRaison.equals("US")) {
        rvhi.setRaison(obtenirRaison(raisons, "BR"));
    }
}
if(pdAcroForm.getField("import_Hmn0"+indexEnString).getValueAsString().length() > 0) 
{
    hasdata = true
}

pdf.close();

return hasdata;

现在，我的问题是对包含多个具有相同字段名称但字段中数据不同的相同页面的 pdf 执行相同的操作.我想遍历每个页面并调用相同的方法并检索每个页面上的字段数据.

Now, my problem is to do the same thing with a pdf that contains multiple identical pages with the same field names, but with different data in the fields. I would like to iterate through each pages and call the same method and retrieve the fields data on each page.

我使用下面的这段代码来遍历 pdf 的页面，但我不知道如何获取当前页面上的字段...我不知道如何从 PDPage 对象获取 acroform 字段?

I use this code below to iterate through pages of the pdf, but I don't know how to get the fields on the current page... I don't know how to get the acroform fields from the PDPage object?

PDPageTree nbPages = pdf.getPages();

if(nbPages.getCount() > 1) {
    for(PDPage page : nbPages) {
        ???? how to get fields Acroform from PDPage page ???
    }
}

预先感谢您的回复！

PDFBOX 2.0.18 - 如何遍历 PDF 页面并检索特定字段 [英] PDFBOX 2.0.18 - How to iterates through pages of a PDF and retrieve specific fields

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

PDFBOX 2.0.18 - 如何遍历 PDF 页面并检索特定字段 [英] PDFBOX 2.0.18 - How to iterates through pages of a PDF and retrieve specific fields

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭