iText pdf在使用NOTO字体或Source Hans时不显示中文字符 [英] iText pdf not displaying Chinese characters when using NOTO fonts or Source Hans

查看:630
本文介绍了iText pdf在使用NOTO字体或Source Hans时不显示中文字符的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在尝试使用NOTO字体(



并且在此代码中

  public void createPdf(String filename,boolean appearances,boolean font)
throws IOException,DocumentException {
// step 1
Document document = new Document();
//步骤2
PdfWriter writer = PdfWriter.getInstance(document,new FileOutputStream(filename));
//步骤3
document.open();
//步骤4
writer.getAcroForm()。setNeedAppearances(appearances);
TextField text = new TextField(writer,new Rectangle(36,806,559,780),description);
text.setOptions(TextField.MULTILINE);
if(font){
BaseFont unicode =
BaseFont.createFont(c:/windows/fonts/arialuni.ttf,BaseFont.IDENTITY_H,BaseFont.NOT_EMBEDDED);
text.setExtensionFont(BaseFont.createFont());
ArrayList< BaseFont> list = new ArrayList< BaseFont>();
list.add(unicode);
text.setSubstitutionFonts(list);
BaseFont f =(BaseFont)text.getSubstitutionFonts()。get(0);
System.out.println(f.getPostscriptFontName());

}
text.setText(TEXT);

writer.addAnnotation(text.getTextField());
//步骤5
document.close();
}

我用C代替c:/windows/fonts/arialuni.ttf: /temp/fonts/NotoSansCJKtc-Thin.otf,我没看到汉字。现在要转换的文字是

  public static final String TEXT =这些是'Hero'的主角,张电影Yimou:\ n
+\ u7121 \ u540d(无名),\ u6b98 \ u528d(断剑),
+\ u98db\ u96ea(飞雪) ,\ u5982 \ u6708(月亮),
+\ u79e6 \ u738b(国王)和\ u9577 \ u7a7a(天空)。;


解决方案

显然你使用了错误的字体。我已从您发布的链接下载了字体。您使用的是NotoSerif-Bold.ttf,不支持支持中文的字体。但是,ZIP文件还包含字体名称中带有CJK的字体。 如您所指的网站所述,CJK代表中文,日文和韩文。使用其中一种CJK字体,您就可以在PDF中生成中文文本。



看一下,但没有发现错误:





[*] 更新:如果您使用iText群集NV未知的版本iText 4.2.x,则可以重现此问题。请仅使用高于5的iText版本。


I am trying to use NOTO fonts (https://www.google.com/get/noto/) to display Chinese characters. Here is my sample code,a modified sample code from iText.

    public void createPdf(String filename) throws IOException, DocumentException {

            Document document = new Document();
            PdfWriter.getInstance(document, new FileOutputStream(filename));
            document.open();

            //This is simple English Font
            FontFactory.register("c:/temp/fonts/NotoSerif-Bold.ttf", "my_nato_font");
            Font myBoldFont = FontFactory.getFont("my_nato_font");
            BaseFont bf = myBoldFont.getBaseFont();
            document.add(new Paragraph(bf.getPostscriptFontName(), myBoldFont));


            //This is Chinese font


            //Option 1 :
           Font myAdobeTypekit = FontFactory.getFont("SourceHanSansSC-Regular", BaseFont.IDENTITY_H, BaseFont.NOT_EMBEDDED);

         //Option 2 :
         /*FontFactory.register("C:/temp/AdobeFonts/source-han-sans-1.001R/OTF/SimplifiedChinese/SourceHanSansSC-Regular.otf", "my_hans_font");
         Font myAdobeTypekit = FontFactory.getFont("my_hans_font", BaseFont.IDENTITY_H, BaseFont.EMBEDDED);*/



            document.add(Chunk.NEWLINE);
            document.add(new Paragraph("高興", myAdobeTypekit));
            document.add(Chunk.NEWLINE);

            //simplified chinese
            document.add(new Paragraph("朝辞白帝彩云间", myAdobeTypekit));
            document.add(Chunk.NEWLINE);

            document.add(new Paragraph("高兴", myAdobeTypekit));
            document.add(new Paragraph("The Source Han Sans Traditional Chinese ", myAdobeTypekit));


            document.close();
        }

I have downloaded the fonts files on my machine. I am using two approaches

  1. To use the equivalent font family in Adobe

  2. Embed the otf file in pdf

Using approach 1, I would expect the Chinese characters to be displayed in pdf but English text is displayed and it is blank for Chinese characters.

Using approach 2, when I try embedding the fonts with pdf, which is not the path I would like to take, there is error in opening pdf.

Update : If I look at this example http://itextpdf.com/examples/iia.php?id=214

and in this code

public void createPdf(String filename, boolean appearances, boolean font)
    throws IOException, DocumentException {
    // step 1
    Document document = new Document();
    // step 2
    PdfWriter writer = PdfWriter.getInstance(document, new FileOutputStream(filename));
    // step 3
    document.open();
    // step 4
    writer.getAcroForm().setNeedAppearances(appearances);
    TextField text = new TextField(writer, new Rectangle(36, 806, 559, 780), "description");
    text.setOptions(TextField.MULTILINE);
    if (font) {
        BaseFont unicode =
            BaseFont.createFont("c:/windows/fonts/arialuni.ttf", BaseFont.IDENTITY_H, BaseFont.NOT_EMBEDDED);
        text.setExtensionFont(BaseFont.createFont());
        ArrayList<BaseFont> list = new ArrayList<BaseFont>();
        list.add(unicode);
        text.setSubstitutionFonts(list);
        BaseFont f= (BaseFont)text.getSubstitutionFonts().get(0);
        System.out.println(f.getPostscriptFontName());

    }
    text.setText(TEXT);

    writer.addAnnotation(text.getTextField());
    // step 5
    document.close();
}

I substitute, c:/windows/fonts/arialuni.ttf with C:/temp/fonts/NotoSansCJKtc-Thin.otf , I do not see the Chinese characters. The text to convert now is

public static final String TEXT = "These are the protagonists in 'Hero', a movie by Zhang Yimou:\n"
    + "\u7121\u540d (Nameless), \u6b98\u528d (Broken Sword), "
    + "\u98db\u96ea (Flying Snow), \u5982\u6708 (Moon), "
    + "\u79e6\u738b (the King), and \u9577\u7a7a (Sky).";

解决方案

Clearly you are using the wrong font. I have downloaded the fonts from the link you posted. You are using NotoSerif-Bold.ttf, a font that does not support Chinese. However, the ZIP file also contains fonts with CJK in the font name. As described on the site you refer to, CJK stands for Chinese, Japanese and Korean. Use one of those CJK fonts and you'll be able to product Chinese text in your PDF.

Take a look at the NotoExample in which I use one of the fonts from the ZIP file you refer to. It creates a PDF that looks like this:

This is the code I used:

public static final String FONT = "resources/fonts/NotoSansCJKsc-Regular.otf";
public static final String TEXT = "These are the protagonists in 'Hero', a movie by Zhang Yimou:\n"
    + "\u7121\u540d (Nameless), \u6b98\u528d (Broken Sword), "
    + "\u98db\u96ea (Flying Snow), \u5982\u6708 (Moon), "
    + "\u79e6\u738b (the King), and \u9577\u7a7a (Sky).";
public static final String CHINESE = "\u5341\u950a\u57cb\u4f0f";
public static final String JAPANESE = "\u8ab0\u3082\u77e5\u3089\u306a\u3044";
public static final String KOREAN = "\ube48\uc9d1";

public void createPdf(String dest) throws IOException, DocumentException {
    Document document = new Document();
    PdfWriter.getInstance(document, new FileOutputStream(DEST));
    document.open();
    Font font = FontFactory.getFont(FONT, BaseFont.IDENTITY_H, BaseFont.EMBEDDED);
    Paragraph p = new Paragraph(TEXT, font);
    document.add(p);
    document.add(new Paragraph(CHINESE, font));
    document.add(new Paragraph(JAPANESE, font));
    document.add(new Paragraph(KOREAN, font));
    document.close();
}

You claim that Adobe Reader XI doesn't show the Chinese glyphs, but instead shows a "Cannot extract the embedded Font" message. I can not reproduce this [*]. I have even used Preflight in Adobe Acrobat as indicated here, but no errors were found:

[*] Update: this problem can be reproduced if you use iText 4.2.x, a version that was released by somebody unknown to iText Group NV. Please use iText versions higher than 5 only.

这篇关于iText pdf在使用NOTO字体或Source Hans时不显示中文字符的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆