如何使用java获取pdf中任何给定单词的（x，y width height）

Question

I need to get x,y ,width and height of a given word in pdf. 我需要获取pdf中给定单词的x，y，宽度和高度。 so that later while parsing the same type of file i can fetch value from the co-ordinate itself. 这样，稍后在解析相同类型的文件时，我就可以从坐标本身获取值。 How should i get position of a word from PDF using java. 我应该如何使用Java从PDF中获取单词的位置。

Rectangle rect = new Rectangle(451, 125,100,1); // i need to get this co-ordate for any particular word
stripper.addRegion("class1", rect);
stripper.extractRegions(pdDocument.getPage(0));
System.out.println("stripper "+stripper.getTextForRegion("class1").trim());

Answer 1

我认为您可以利用Apache的PDFBox API并遵循这个类似问题的建议，该问题专门针对该API编写您所需的代码。

如何使用java获取pdf中任何给定单词的（x，y width height）

问题描述

1 个解决方案

解决方案1
0 2019-04-10 18:55:09

如何使用java获取pdf中任何给定单词的（x，y width height）

问题描述

1 个解决方案

解决方案1 0 2019-04-10 18:55:09

解决方案1
0 2019-04-10 18:55:09