我正在使用pdfbox-app-2.0.0-RC3但在PDF解析器中使用RndomAccessFile時仍然出現錯誤

Question

-您可以通過以下鏈接查看示例： http : //radixcode.com/pdfbox-example-code-how-to-extract-text-from-pdf-file-with-java/

import java.io.IOException;

public class JavaPDFTest {

    public static void main(String[] args) throws IOException {

       PDFManager pdfManager = new PDFManager();
       pdfManger.setFilePath("E:\test.pdf");
       System.out.println(pdfManager.ToText());       
    }    
}

import java.io.File;
import java.io.IOException;
import org.apache.pdfbox.cos.COSDocument;
import org.apache.pdfbox.io.RandomAccessFile;
import org.apache.pdfbox.pdfparser.PDFParser;
import org.apache.pdfbox.pdmodel.PDDocument;
import org.apache.pdfbox.text.PDFTextStripper;

public class PDFManager {

   private PDFParser parser;
   private PDFTextStripper pdfStripper;
   private PDDocument pdDoc ;
   private COSDocument cosDoc ;

   private String Text ;
   private String filePath;
   private File file;

    public PDFManager() {

    }
   public String ToText() throws IOException
   {
       this.pdfStripper = null;
       this.pdDoc = null;
       this.cosDoc = null;

       file = new File(filePath);
       parser = new PDFParser(new RandomAccessFile(file,"r")); // update for PDFBox V 2.0

       parser.parse();
       cosDoc = parser.getDocument();
       pdfStripper = new PDFTextStripper();
       pdDoc = new PDDocument(cosDoc);
       pdDoc.getNumberOfPages();
       pdfStripper.setStartPage(1);
       pdfStripper.setEndPage(10);

       // reading text from page 1 to 10
       // if you want to get text from full pdf file use this code
       // pdfStripper.setEndPage(pdDoc.getNumberOfPages());

       Text = pdfStripper.getText(pdDoc);
       return Text;
   }

    public void setFilePath(String filePath) {
        this.filePath = filePath;
    }
}

錯誤

線程“主”中的異常java.lang.ClassCastException：java.io.RandomAccessFile無法轉換為org.apache.pdfbox.io.RandomAccessRead
在aechaec.PDFManager.ToText（PDFManager.java:43）
在aechaec.AechAEC.main（AechAEC.java:25）
Java結果：1

是由安全特權引起的嗎？ 因為我在Mac上使用Netbeans？

Answer 1

周圍有許多過時的示例，它們可能會起作用，也可能不會起作用。 請替換此代碼

file = new File(filePath);
parser = new PDFParser(new RandomAccessFile(file,"r"));
parser.parse();
cosDoc = parser.getDocument();
pdfStripper = new PDFTextStripper();
pdDoc = new PDDocument(cosDoc);

這段代碼：

pdDoc = PDDocument.load(new File(filePath));
pdfStripper = new PDFTextStripper();

並更新到2.0的發行版本。

我正在使用pdfbox-app-2.0.0-RC3但在PDF解析器中使用RndomAccessFile時仍然出現錯誤

問題描述

1 個解決方案

解決方案1
10 已采納 2016-04-09 18:35:30

我正在使用pdfbox-app-2.0.0-RC3但在PDF解析器中使用Rndo​​mAccessFile時仍然出現錯誤

問題描述

1 個解決方案

解決方案1 10 已采納 2016-04-09 18:35:30

我正在使用pdfbox-app-2.0.0-RC3但在PDF解析器中使用RndomAccessFile時仍然出現錯誤

解決方案1
10 已采納 2016-04-09 18:35:30