问题描述
我需要在我的程序中将 PDF 表解析为对象,但是我在从 PDF 文件读取表时遇到问题,因为很少有列是空的,但其他列是满的,所以 here is the problem:
-
逐行 (PDFTextStripper)
for (int i = START_PAGE; i <= numberOfPages; i++) { stripper.setStartPage(i); stripper.setEndPage(i); String page = stripper.getText(document); String[] lines = page.split(stripper.getLineseparator()); int rowToStart = i == START_PAGE ? ROW_TO_START_ON_START_PAGE : ROW_TO_START_ON_NEXT_PAGE; for (int j = rowToStart; j < lines.length - 1; j++) { log.info(lines[j]); }
-
通过使用 PDFTextStripperByArea
Rectangle postCodesColumn = new Rectangle(0,40,55,780); stripper.addRegion("postCodes",postCodesColumn); stripper.addRegion("city",cityColumn); stripper.addRegion("street",streetColumn); stripper.addRegion("streetNumber",streetNumberColumn); stripper.addRegion("borough",boroughColumn); stripper.addRegion("county",countyColumn); stripper.addRegion("voivodeship",voivodeshipColumn); stripper.extractRegions(document.getPage(i));
解决方法
暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!
如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。
小编邮箱:dio#foxmail.com (将#修改为@)