简体   繁体   中英

How to get the number of pages of a .PDF uploaded by user?

我有一个文件输入,在“上传”之前,我需要计算 JAVASCRIPT 中该 .pdf 的页数(例如 JQuery ...)

In case you use pdf.js you may reference an example on github ('.../examples/node/getinfo.js') with following code that prints number of pages in a pdf file.

const pdfjsLib = require('pdfjs-dist');
...
pdfjsLib.getDocument(pdfPath).then(function (doc) {
    var numPages = doc.numPages;
    console.log('# Document Loaded');
    console.log('Number of Pages: ' + numPages);
})

and a pure javascript solution:

var input = document.getElementById("files");
var reader = new FileReader();
reader.readAsBinaryString(input.files[0]);
reader.onloadend = function(){
    var count = reader.result.match(/\/Type[\s]*\/Page[^s]/g).length;
    console.log('Number of Pages:',count );
}

As has been stated in the other answers, something like pdf.js is be what you are looking for. I've taken a look at the API and it does include a numPages() function to return the total number of pages. It also seems to count pages for me when viewing the demo page from Mozilla.

It depends if you are able to use modern browsers and experimental technology for your solution. pdf.js is very impressive, but it is still experimental according to the github page .

If you are able to count the pages on the server after uploading, then you should look atpdftools or similar.

Something like pdftools --countpages is what you are looking for

I think the API has changed a little since Tracker1 posted an answer. I tried Tracker1's code and saw this error:

Uncaught TypeError: pdfjsLib.getDocument(...).then is not a function

A small change fixes it:

const pdfjsLib = require('pdfjs-dist');
...
pdfjsLib.getDocument(pdfPath).promise.then(function (doc) {
    var numPages = doc.numPages;
    console.log('# Document Loaded');
    console.log('Number of Pages: ' + numPages);
}

You could also use pdf-lib .

You will need to read the file from the input field and then make use of pdf-lib to get the number of pages. The code would be like this:

import { PDFDocument } from 'pdf-lib';

...

const readFile = (file) => {

  return new Promise((resolve, reject) => {

    const reader = new FileReader();

    reader.onload = () => resolve(reader.result);
    reader.onerror = error => reject(error);

    reader.readAsArrayBuffer(file);
  });
}

const async getNumPages = (file) => {

  const arrayBuffer = await readFile(file);

  const pdf = await PDFDocument.load(arrayBuffer);

  return pdf.getPages();
}

And then just get the number of pages of the attached file with:

const numPages = await getNumPages(input.files[0]);

being input the variable which stores the reference to the DOM element of the file input.

In typescript class using Pdf-lib I use the following.

 // getPAGE COUNT: async getPageCount(formUrl: any): Promise<number>{ const LogPdfFields = [] as any[]; const formPdfBytes = await fetch(formUrl).then((res) => res.arrayBuffer()); const pdfDoc = await PDFDocument.load(formPdfBytes); const pageCount = pdfDoc.getPageCount(); return pageCount; }

Call as a promise

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM