简体繁体中英

How to search through thousands of files to text efficiently in real-time

原文 2016-03-23 17:54:56 8 1 c#/ sql-server/ full-text-search

I'm working on refactoring a document storage service's site to go from a proprietary storage system to SQL. Everything is going fairly well, but I need to find a way to search through our repository for specific strings of text. We use a multitude of different file types (.xls,.xlsx,.doc,.txt, etc). They're displayed to the user by first converting them to a PDF, via line-by-line rebuilding using PDFSharp.

The speed isn't a consideration for viewing/searching a single file, but I have concerns about scalability. I was able to make a functioning text search by copying and then hooking into our conversion process, but I am fairly sure that this will not work for searching through a customer's entire document list (thousands and thousands of documents). If these were all of a uniform file type, it might be easier to do, but they aren't.

Is there an efficient way to do this of which I am unaware?

EDIT: The documents are stored on the server and referenced via document URLs in the DB

1 answers

My recommendation is to build an index, either in SQL or in a file. One that matches files with all the possible search terms of interest in each file.

How to detect real-time change of text files?

real-time search in textbox

Redirecting a Stream to a Text Box in real-time

How to search and replace text in TextBox real time

How to add a custom component in real-time?

How to update textblock in real-time?

How to broadcast real-time data in SignalR?

How to access TabPages that are created in real-time?

How to render images in real-time

Real-Time Subscriptions

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to detect real-time change of text files? real-time search in textbox Redirecting a Stream to a Text Box in real-time How to search and replace text in TextBox real time How to add a custom component in real-time? How to update textblock in real-time? How to broadcast real-time data in SignalR? How to access TabPages that are created in real-time? How to render images in real-time Real-Time Subscriptions

Related Tags

How to search through thousands of files to text efficiently in real-time

Question

1 answers

solution1 1 ACCPTED 2016-03-23 19:05:25

solution1
1 ACCPTED 2016-03-23 19:05:25