简体   繁体   中英

Multi Curl, error handling

I'm working with multi Curl and was wondering how to handle the errors. I want to check which error occured and if it is an error like, rate limit exceeded I want to crawl that link again after some delay (sleep()). My question: "Is there a build in function which can do this for me or do I need to collect all Urls in an array and just run those again?"

This is what I've got now:

<?php

$urls = array(  "https://API-URL.com",
                "https://API-URL.com",
                "https://API-URL.com",
                "https://API-URL.com",
                ...);

//create the multiple cURL handle
$mh = curl_multi_init();

//Number of elements in $urls
$nbr = count($urls);

// set URL and options
for($x = 0; $x < $nbr; $x++){

    // create both cURL resources
    $ch[$x] = curl_init();

    // set URL and other appropriate options
    curl_setopt($ch[$x], CURLOPT_URL, $urls[$x]);
    curl_setopt($ch[$x], CURLOPT_RETURNTRANSFER, true );
    curl_setopt($ch[$x], CURLOPT_SSL_VERIFYPEER, false);

    //add the two handles
    curl_multi_add_handle($mh,$ch[$x]);
}

//execute the handles
do {
    curl_multi_exec($mh, $running);
} while ($running);

for($x = 0; $x < $nbr; $x++){
    $result = curl_multi_getcontent($ch[$x]);

    $decoded = json_decode($result, true);

    //get info about the request
    $error = curl_getinfo($ch[$x], CURLINFO_HTTP_CODE);

    //error handling
    if($error != 200){

        $again[] = array("Url" => $urls[$x], "errornbr" => $error); 

    } else {

        // Here I do what ever I want with the data
    }

    curl_multi_remove_handle($mh, $ch[$x]);
    curl_close($ch[1]);
}

curl_multi_close($mh);
?>

In the second for-loop, when you are cycling through the curl handlers to examine what did each curl handler return, I hope, this approach will answer you question

foreach ($ch as $key => $h) {

//This code is actually checking for any error that may occur, whatever that 
//error is you can handle it in the if-part of the condition. and save those 
//urls to the array $again to call them on a later stage.

if (curl_errno($h)) {

//this is how you will get complete information what did happened to the 
//curl handler. And why did it fail. All the inforation will be stored in //error_info.

 $again[] = array("Url" =>curl_getinfo($h, CURLINFO_EFFECTIVE_URL), "error_info" => curl_getinfo($h)); 

}
else{

//here you will handle the success scenario for each curl handler.

$responses[$key] = ['data' => curl_multi_getcontent($h)];
}

//remove curl handler as you are doing in the loop

}

For multiple handles there ishttps://www.php.net/manual/en/function.curl-multi-info-read.php

so error check (assuming http connection) should look like:

while ($a = curl_multi_info_read($mh))
{
  if ($b = $a['result'])
  {
    echo curl_strerror($b);# CURLE_* error
  }
  elseif (!($b = curl_getinfo($a['handle'], CURLINFO_RESPONSE_CODE)))
  {
    echo 'connection failed';
  }
  elseif ($b !== 200)
  {
    echo 'HTTP status is not 200 OK';
  }
}

Consider this code as pseudo-code for modern PHPs (i didn't test this exact variant, but scheme will work). Calling curl_errno() on "easy" handles added to "multi" handle will return 0 which is not an error.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM