iText 7 HTML到Pdf转换并将外部文件链接到生成的pdf 关于revision 2 of your question 关于revision 1 of your question

问题描述

在合并从IText生成的两个PDF时遇到问题。 我是iText7的新手 我正在从html创建一个pdf,并使用excel(.xls)作为嵌入式文档创建另一个pdf。 我要合并两个文件

基本上我想从html生成PDF,然后将excel文档附加到它,然后从这两个pdf输出组合的html outPutStream。

下面是我正在使用的代码

    ByteArrayOutputStream htmlToPdfContent = new ByteArrayOutputStream();
    PdfWriter writer = new PdfWriter(htmlToPdfContent);
    PdfDocument pdf = new PdfDocument(writer);
    pdf.setTagged();
    PageSize pageSize = PageSize.A4.rotate();
    pdf.setDefaultPageSize(pageSize);
    ConverterProperties properties = new ConverterProperties();
    HtmlConverter.convertToPdf(htmlContent,pdf,properties);

    FileUtils.cleanDirectory(new File(outputDir));

    ByteArrayOutputStream pdfResult = new ByteArrayOutputStream();
    PdfWriter writerResult = new PdfWriter(pdfResult);
    PdfDocument pdfDocResult = new PdfDocument(writerResult);

    PdfReader reader = new PdfReader(new ByteArrayInputStream(htmlToPdfContent.toByteArray()));
    PdfDocument pdfDoc = new PdfDocument(reader);
    pdfDoc.copyPagesTo(1,pdfDoc.getNumberOfPages(),pdfDocResult);

    ByteArrayOutputStream pdfAttach = new ByteArrayOutputStream();
    PdfDocument pdfLaunch = new PdfDocument(new PdfWriter(pdfAttach));
    Rectangle rect = new Rectangle(36,700,100,100);
    byte[] embeddedFileContentBytes = Files.readAllBytes(Paths.get(excelPath));
    PdfFileSpec fs = PdfFileSpec.createEmbeddedFileSpec(pdfLaunch,embeddedFileContentBytes,null,"test.xlsx",null);
    PdfAnnotation attachment = new PdfFileAttachmentAnnotation(rect,fs)
            .setContents("Click me");
    pdfLaunch.addNewPage().addAnnotation(attachment);

    PdfDocument appliedChanges = new PdfDocument(new PdfReader(new ByteArrayInputStream(pdfAttach.toByteArray())));

    appliedChanges.copyPagesTo(1,appliedChanges.getNumberOfPages(),pdfDocResult);
    try(OutputStream outputStream = new FileOutputStream(dest)) {
        pdfResult.writeto(outputStream);
    }

这引发异常

13:56:05.724 [main] ERROR com.itextpdf.kernel.pdf.PdfReader - Error occurred while reading cross reference table. Cross reference table will be rebuilt.
com.itextpdf.io.IOException: Error at file pointer 19,272.
    at com.itextpdf.io.source.PdfTokenizer.throwError(PdfTokenizer.java:678)
    at com.itextpdf.kernel.pdf.PdfReader.readXrefSection(PdfReader.java:801)
    at com.itextpdf.kernel.pdf.PdfReader.readXref(PdfReader.java:774)
    at com.itextpdf.kernel.pdf.PdfReader.readPdf(PdfReader.java:538)
    at com.itextpdf.kernel.pdf.PdfDocument.open(PdfDocument.java:1818)
    at com.itextpdf.kernel.pdf.PdfDocument.<init>(PdfDocument.java:238)
    at com.itextpdf.kernel.pdf.PdfDocument.<init>(PdfDocument.java:221)
    at com.mediaocean.prisma.order.command.infrastructure.pdf.itext.PdfAttachmentLaunch.main(PdfAttachmentLaunch.java:76)
Caused by: com.itextpdf.io.IOException: xref subsection not found.
    ... 8 common frames omitted
Exception in thread "main" com.itextpdf.kernel.PdfException: Trailer not found.
    at com.itextpdf.kernel.pdf.PdfReader.rebuildXref(PdfReader.java:1064)
    at com.itextpdf.kernel.pdf.PdfReader.readPdf(PdfReader.java:543)
    at com.itextpdf.kernel.pdf.PdfDocument.open(PdfDocument.java:1818)
    at com.itextpdf.kernel.pdf.PdfDocument.<init>(PdfDocument.java:238)
    at com.itextpdf.kernel.pdf.PdfDocument.<init>(PdfDocument.java:221)
    at com.mediaocean.prisma.order.command.infrastructure.pdf.itext.PdfAttachmentLaunch.main(PdfAttachmentLaunch.java:88)
13:56:05.773 [main] ERROR com.itextpdf.kernel.pdf.PdfReader - Error occurred while reading cross reference table. Cross reference table will be rebuilt.
com.itextpdf.io.IOException: PDF startxref not found.
    at com.itextpdf.io.source.PdfTokenizer.getStartxref(PdfTokenizer.java:262)
    at com.itextpdf.kernel.pdf.PdfReader.readXref(PdfReader.java:753)
    at com.itextpdf.kernel.pdf.PdfReader.readPdf(PdfReader.java:538)
    at com.itextpdf.kernel.pdf.PdfDocument.open(PdfDocument.java:1818)
    at com.itextpdf.kernel.pdf.PdfDocument.<init>(PdfDocument.java:238)
    at com.itextpdf.kernel.pdf.PdfDocument.<init>(PdfDocument.java:221)
    at com.mediaocean.prisma.order.command.infrastructure.pdf.itext.PdfAttachmentLaunch.main(PdfAttachmentLaunch.java:88)

请告知。在此先感谢!

解决方法

关于revision 2 of your question

您更改代码的方式与我对问题的第一次修订的回答中所建议的方式不同,现在您转换为以前未使用的PdfDocument pdf,而不是直接转换为ByteArrayOutputStream htmlToPdfContent

这实际上也可以解决该答案中标识的问题。因此,这里不再有例外:

PdfReader reader = new PdfReader(new ByteArrayInputStream(htmlToPdfContent.toByteArray()));
PdfDocument pdfDoc = new PdfDocument(reader);

相反,您现在可以在流程的更深处看到一个异常,

PdfDocument appliedChanges = new PdfDocument(new PdfReader(new ByteArrayInputStream(pdfAttach.toByteArray())));

原因很简单,您尚未关闭写入PdfDocument pdfLaunch的{​​{1}}。但是只有关闭才能最终确定输出流中的PDF。因此,添加ByteArrayOutputStream pdfAttach

close()

实际上,您很快又犯了同样的错误,不久之后,您将ByteArrayOutputStream pdfAttach = new ByteArrayOutputStream(); PdfDocument pdfLaunch = new PdfDocument(new PdfWriter(pdfAttach)); [...] pdfLaunch.addNewPage().addAnnotation(attachment); pdfLaunch.close(); //<==== added PdfDocument appliedChanges = new PdfDocument(new PdfReader(new ByteArrayInputStream(pdfAttach.toByteArray()))); 的内容存储到ByteArrayOutputStream pdfResult而没有关闭写入outputStream的{​​{1}}。因此,还可以在此处添加一个PdfDocument pdfDocResult调用:

pdfResult

关于revision 1 of your question

您将close用作两个不同的PDF生成器的目标,这两个appliedChanges.copyPagesTo(1,appliedChanges.getNumberOfPages(),pdfDocResult); pdfDocResult.close(); //<==== added try(OutputStream outputStream = new FileOutputStream(dest)) { pdfResult.writeTo(outputStream); } 通过ByteArrayOutputStream htmlToPdfContentPdfDocument pdf调用:

PdfWriter writer

这使HtmlConverter.convertToPdf的内容成为两者输出的大杂烩,尤其是无效的PDF。

由于您没有向ByteArrayOutputStream htmlToPdfContent = new ByteArrayOutputStream(); PdfWriter writer = new PdfWriter(htmlToPdfContent); PdfDocument pdf = new PdfDocument(writer); pdf.setTagged(); PageSize pageSize = PageSize.A4.rotate(); pdf.setDefaultPageSize(pageSize); ConverterProperties properties = new ConverterProperties(); HtmlConverter.convertToPdf(content,htmlToPdfContent,properties); 添加任何内容,因此可以安全地将其删除并将上述摘录减少为

htmlToPdfContent