简体   繁体   中英

when is using JDBC with lucene indexes a good idea?

I'm implementing a lucene (Compass) based search engine for our application and we will be deploying to a vmware environment. The client tells me that the I/O performance is so bad on the the vmware host that JDBC storage for lucene would be preferred over shared storage of the index files. I'm surprised by this.

So I'm wondereing if you have a choice, what are some pros and cons of using a database or shared storage for lucene indexes in a clustered environment? Does it depend on the expected number of index writes?

I would ask for some data to back up that assertion. Don't databases use I/O as well? Or would your JDBC solution connect to a non-virtualized database instance?

How often do you re-index with Lucene? I don't see how this could be a problem if you only do it once a day using a separate batch job.

I'd prefer Lucene to JDBC because the solution demanded search capability. Try doing free-form searches with SQL and LIKE clauses sometime. That would seem far more inefficient.

If it's a big concern, try a small benchmark.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM