如何在Java中使用iText从PDF文件中删除页眉和页脚 [英] How to remove headers and footers from PDF file using iText in Java

查看：820 发布时间：2021/2/9 19:49:22 java pdf itext

本文介绍了如何在Java中使用iText从PDF文件中删除页眉和页脚的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用PDF iText库将PDF转换为文本.

I am using the PDF iText library to convert PDF to text.

下面是我的代码，可使用Java将PDF转换为文本文件.

Below is my code to convert PDF to text file using Java.

public class PdfConverter {

/** The original PDF that will be parsed. */
public static final String pdfFileName = "jdbc_tutorial.pdf";
/** The resulting text file. */
public static final String RESULT = "preface.txt";

/**
 * Parses a PDF to a plain text file.
 * @param pdf the original PDF
 * @param txt the resulting text
 * @throws IOException
 */
public void parsePdf(String pdf, String txt) throws IOException {
    PdfReader reader = new PdfReader(pdf);
    PdfReaderContentParser parser = new PdfReaderContentParser(reader);
    PrintWriter out = new PrintWriter(new FileOutputStream(txt));

    TextExtractionStrategy strategy;
    for (int i = 1; i <= reader.getNumberOfPages(); i++) {
        strategy = parser.processContent(i, new SimpleTextExtractionStrategy());
        out.println(strategy.getResultantText());
        System.out.println(strategy.getResultantText());
    }
    out.flush();
    out.close();
    reader.close();
}

/**
 * Main method.
 * @param    args    no arguments needed
 * @throws IOException
 */
public static void main(String[] args) throws IOException {
    new PdfConverter().parsePdf(pdfFileName, RESULT);
}
}

以上代码可用于将PDF提取为文本.但是我的要求是忽略页眉和页脚，仅从PDF文件中提取内容.

The above code works for extracting PDF to text. But my requirement is to ignore header and footer and extract only content from PDF file.

如何在Java中使用iText从PDF文件中删除页眉和页脚 [英] How to remove headers and footers from PDF file using iText in Java

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

如何在Java中使用iText从PDF文件中删除页眉和页脚 [英] How to remove headers and footers from PDF file using iText in Java

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭