Grab a website with PHP and then traverse it with jQuery

Question

我正在构建一个系统，我需要用PHP抓取网页的内容，然后解析它以提取某些表等。有一个简单的方法用jQuery做这个或者最好的方法是编写PHP提取数据的功能？

Answer 1

jQuery has nothing to do with PHP and can't be run without a browser, so you're out of luck there.

However, there is phpQuery that allows DOM parsing with jQuery's selectors!

Answer 2

Do It like this in php with native php DOM functions and xpath:

    $dom = new DOMDocument();
    @$dom->loadHTML($html);
    $x = new DOMXPath($dom);
    // grab all tables with id of foo
    foreach($x->query("//table[@id='foo']") as $node)
    {
        // here is the html
                    echo $node->c14n();
                    // grab the containing text 
                    echo $node->textContent()
    }

Answer 3

您可以使用PHP http://php.net/manual/en/book.dom.php中提供的DOM函数

Answer 4

You can't. jQuery is for JavaScript, which is client-side, and requires a JavaScript engine to execute.

I would suggest you read the HTML as XML, but you'll run into all sorts of trouble if the HTML is not XHTML valid.

Answer 5

this is awesome

http://sourceforge.net/projects/simplehtmldom/

example:

// Create DOM from URL or file
$html = file_get_html('http://www.google.com/');

// Find all images
foreach($html->find('img') as $element)
       echo $element->src . '<br>';

// Find all links
foreach($html->find('a') as $element)
       echo $element->href . '<br>';

Answer 6

There are a few php packages that can help you with this, curl, dom and xpath.

Here's a good tutorial I've used before.

Grab a website with PHP and then traverse it with jQuery

Question

6 answers

solution1
7 ACCPTED 2010-11-03 20:00:28

solution2
3 2010-11-03 20:07:18

solution3
1 2010-11-03 20:03:37

solution4
1 2010-11-03 20:05:25

solution5
0 2010-11-03 20:05:38

solution6
0 2010-11-03 20:08:55

Grab a website with PHP and then traverse it with jQuery

Question

6 answers

solution1 7 ACCPTED 2010-11-03 20:00:28

solution2 3 2010-11-03 20:07:18

solution3 1 2010-11-03 20:03:37

solution4 1 2010-11-03 20:05:25

solution5 0 2010-11-03 20:05:38

solution6 0 2010-11-03 20:08:55

solution1
7 ACCPTED 2010-11-03 20:00:28

solution2
3 2010-11-03 20:07:18

solution3
1 2010-11-03 20:03:37

solution4
1 2010-11-03 20:05:25

solution5
0 2010-11-03 20:05:38

solution6
0 2010-11-03 20:08:55