简体   繁体   中英

PDF parser text contains

I want to verify PDF Document using TestNG and PDFBox.

I would ask, is PDF able to check contains text like this:

PDFParser parser =  new PDFParser(stream);
parser.getDocument().conntains("ABC")

Try below code:-

  public void ReadPDF() throws Exception {
    URL TestURL = new URL("http://www.axmag.com/download/pdfurl-guide.pdf");

    BufferedInputStream TestFile = new BufferedInputStream(TestURL.openStream());
    PDFParser TestPDF = new PDFParser(TestFile);
    TestPDF.parse();
    String TestText = new PDFTextStripper().getText(TestPDF.getPDDocument());

    Assert.assertTrue(TestText.contains("Open the setting.xml, you can see it is like this"));

    }

Download libraries :- https://pdfbox.apache.org/index.html

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM