简体繁体中英

Searching through PDF text with Node.js

原文 2018-08-14 18:59:04 5 1 mysql/ node.js/ google-app-engine/ pdf/ pdftextstream

I have thousands of searchable PDFs, some of which are up to a 1GB with over 2000 pages. I need to be able to search for a text string in these files using a Node.js app.

Right now, files are stored in a Google Cloud Storage bucket.

What's the best way to do this?

Some options:

Read the text from PDF files into MySQL using something like NPM package pdf-text-extract . Then use MySQL queries to search for text strings.
Search the PDF files directly using some NPM package.

Am I completely off? Is there a better way?

1 answers

There are dedicated text search libraries out there, like this one , or this . Most likely you'd need to extract plain text from each pdf, save and index them. Then you'll be able to run search queries. Setting up database for this particular task may be an overkill.

Searching through PDF text and returning a snippet with Node.js

Multiple searching using req.params node.js

Loop Through Array With Node.JS and EJS

Node.js connecting through ssh

Connecting to MySQL through Node.js

Download PDF From mysql using node.js and ejs

How to save pdf file in mysql with node.js?

Trouble connecting to MySQL through Node.js/Express

access denied while connecting to mysql through node.js

Insert Chinese character into MySQL through node.js

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Searching through PDF text and returning a snippet with Node.js Multiple searching using req.params node.js Loop Through Array With Node.JS and EJS Node.js connecting through ssh Connecting to MySQL through Node.js Download PDF From mysql using node.js and ejs How to save pdf file in mysql with node.js? Trouble connecting to MySQL through Node.js/Express access denied while connecting to mysql through node.js Insert Chinese character into MySQL through node.js

Related Tags

Searching through PDF text with Node.js

Question

1 answers

solution1 0 2018-08-14 19:34:45

solution1
0 2018-08-14 19:34:45