简体繁体中英

How to optimize indexing of large number of DB records using Zend_Lucene and Zend_Paginator

原文 2010-04-23 13:39:10 8 1 php/ zend-framework/ indexing/ lucene/ zend-search-lucene

So I have this cron script that is deployed and ran using Cron on a host and indexes all the records in a database table - the index is later used both for the front end of the site and the backed operations as well.

After the operation, the index is about 3-4 MB.

The problem is it takes a lot of resources (CPU: 30+ and a good chunk of memory) and slows the machine down. My question is about how to optimize the operation described below:

First there is a select query built using the Zend Framework API, this query is then passed to a Paginator factory that returns a paginator which I am using to balance the current number of items being indexed and not iterate over too much items. The script is iterating over the current items in the paginator object using a foreach loop until reaching the end and then it starts from the beginning after getting items for the next page.

I am suspecting this overhead is caused by the Zend_Lucene but no idea how this could be improved.

1 answers

See my answer to Can I predict how large my Zend Framework index will be?

I tested Zend_Search_Lucene versus Apache Lucene (the Java version). In my test, the Java product indexed 1.5 million documents about 300x faster than the PHP product.

You'll be much happier using Apache Solr (the Tomcat container for Apache Lucene). Solr includes a tool called DataImportHandler that sucks data directly from a JDBC data source.

Use the PECL Solr extension to communicate with Solr from PHP. If you can't install that PHP extension, use Curl which should be available in default installations of PHP.

How to limit the number of pagination links in Zend_Paginator

Zend_Paginator / Doctrine 2

Zend_Paginator; is it optimized?

Confused with Zend_Paginator

Using Zend_Paginator without Select Adapter

Zend_db & Zend_paginator - Not having a fun time

Zend_Lucene: How to combine multiple terms?

Zend_Db_Table Relationships and Zend_Paginator

Zend_lucene search with accents

Zend_Table_Db and Zend_Paginator and Zend_Paginator_Adapter_DbSelect

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to limit the number of pagination links in Zend_Paginator Zend_Paginator / Doctrine 2 Zend_Paginator; is it optimized? Confused with Zend_Paginator Using Zend_Paginator without Select Adapter Zend_db & Zend_paginator - Not having a fun time Zend_Lucene: How to combine multiple terms? Zend_Db_Table Relationships and Zend_Paginator Zend_lucene search with accents Zend_Table_Db and Zend_Paginator and Zend_Paginator_Adapter_DbSelect

Related Tags

How to optimize indexing of large number of DB records using Zend_Lucene and Zend_Paginator

Question

1 answers

solution1 1 2010-04-24 19:03:39

solution1
1 2010-04-24 19:03:39