使用 Phantomjs/Pjscrape 抓取多個頁面

Question

試圖抓取多個頁面，但無法讓 urlid 數組在 pjscrape .js 文件中工作。

我很確定我可能會犯一個新手錯誤，但我會很感激一些幫助。 謝謝：）

pjs.config({

    timeoutInterval: 6000,
    timeoutLimit: 10000,

})

pjs.addSuite({
    // single URL or array
    url: abolaURLs,
    scraper: function(){
        var abolaURLs = [366762,366764,366763];
        for (var i = 0; i<abolaURLs.length; i++) {
            abolaURLs[i] = 'http://abola.pt/nnh/ver.aspx?id=' + abolaURLs[i];
        };
        var results[];
        var cenas1 = $('div#a5g2').text();
        var cenas2 = $('span#noticiatext').text();
        var cenas3 = $('div#a5x').text();
        results.push(cenas1, cenas2, cenas3);
        return results;
    }
});

Answer 1

這對你有用：

var abolaURLs = [366762,366764,366763];

for (var i = 0; i < abolaURLs.length; i++) {
    abolaURLs[i] = 'http://abola.pt/nnh/ver.aspx?id=' + abolaURLs[i];
};

pjs.addSuite({
    url: abolaURLs,
    scraper: function() {
            var results = []; // !! you have the wrong array declaration result[]
            var cenas1 = $('div#a5g2').text();
            var cenas2 = $('span#noticiatext').text();
            var cenas3 = $('div#a5x').text();
            results.push(cenas1, cenas2, cenas3);
            return results;
    }
});

pjs.config({
    timeoutInterval: 6000,
    timeoutLimit: 10000,
});

使用 Phantomjs/Pjscrape 抓取多個頁面

問題描述

1 個解決方案

解決方案1
2 已采納 2012-12-11 23:03:31

使用 Phantomjs/Pjscrape 抓取多個頁面

問題描述

1 個解決方案

解決方案1 2 已采納 2012-12-11 23:03:31

解決方案1
2 已采納 2012-12-11 23:03:31