Regular expression to extract from URI

Question

I need a regular expression to extract from two types of URIs

http://example.com/path/to/page/?filter
http://example.com/path/to/?filter

Basically, in both cases I need to somehow isolate and return

/path/to

and

?filter

That is, both /path/to and filter is arbitrary. So I suppose I need 2 regular expressions for this? I am doing this in PHP but if someone could help me out with the regular expressions I can figure out the rest. Thanks for your time :)

EDIT: So just want to clearify, if for example

http://example.com/help/faq/?sort=latest

I want to get /help/faq and ?sort=latest

Another example

http://example.com/site/users/all/page/?filter=none&status=2

I want to get /site/users/all and ?filter=none&status=2 . Note that I do not want to get the page !

Answer 1

Using parse_url might be easier and have fewer side-effects then regex:

$querystring = parse_url($url, PHP_URL_QUERY); 
$path = parse_url($var, PHP_URL_PATH);

You could then use explode on the path to get the first two segments:

$segments = explode("/", $path);

Answer 2

Try this:

^http://[^/?#]+/([^/?#]+/[^/?#]+)[^?#]*\?([^#]*)

This will get you the first two URL path segments and query.

Answer 3

not tested but:

^https?://[^ /]+[^ ?]+.*

which should match http and https url with or without path, the second argument should match until the ? (from the ?filter for instance) and the .* any char except the \\n.

Answer 4

Have you considered using explode() instead ( http://nl2.php.net/manual/en/function.explode.php ) ? The task seems simple enough for it. You would need 2 calls (one for the / and one for the ?) but it should be quite simple once you did that.

Regular expression to extract from URI

Question

4 answers

solution1
4 ACCPTED 2010-02-26 23:16:05

solution2
0 2010-02-26 23:12:16

solution3
0 2010-02-26 23:12:23

solution4
0 2010-02-26 23:13:21

Regular expression to extract from URI

Question

4 answers

solution1 4 ACCPTED 2010-02-26 23:16:05

solution2 0 2010-02-26 23:12:16

solution3 0 2010-02-26 23:12:23

solution4 0 2010-02-26 23:13:21

solution1
4 ACCPTED 2010-02-26 23:16:05

solution2
0 2010-02-26 23:12:16

solution3
0 2010-02-26 23:12:23

solution4
0 2010-02-26 23:13:21