How to deal with `;` with `urllib.parse.parse_qsl()`?

Question

; can not be dealt by parse_qsl() . Is there a way to make it aware of ; ? Thanks.

>>> import urllib.parse
>>> urllib.parse.parse_qsl('http://example.com/?q=abc&p=1;2;3')
[('http://example.com/?q', 'abc'), ('p', '1')]

Answer 1

It would be best to make sure that the URLs you are dealing with have the semicolons URL encoded. eg http://example.com/?q=abc&p=1%3B2%3B3

If for some reason you can't do the above, you could do something like this:

from urllib.parse import urlparse, unquote_plus

url = "http://example.com/?q=abc&p=1;2;3"
parts = urlparse(url)
qs = parts.query
pairs = [p.split("=", 1) for p in qs.split("&")]
decoded = [(unquote_plus(k), unquote_plus(v)) for (k, v) in pairs]

>>> decoded
[('q', 'abc'), ('p', '1;2;3')]

The above code assumes a few things about the query string. eg that all keys have values. If you want something that makes fewer assumptions, see the parse_qsl source code .

Answer 2

Actually, it does treat them correctly (as delimiters). You just have to tell it to keep blank values:

>>> urllib.parse.parse_qsl('q=abc&p=1;2;3', keep_blank_values=True)
[('q', 'abc'), ('p', '1'), ('2', ''), ('3', '')]

Note that you should not be passing the entire url to parse_qsl , only the querystring part.

How to deal with `;` with `urllib.parse.parse_qsl()`?

Question

2 answers

solution1
2 ACCPTED 2019-11-11 08:34:24

solution2
0 2019-11-11 08:41:46

How to deal with `;` with `urllib.parse.parse_qsl()`?

Question

2 answers

solution1 2 ACCPTED 2019-11-11 08:34:24

solution2 0 2019-11-11 08:41:46

solution1
2 ACCPTED 2019-11-11 08:34:24

solution2
0 2019-11-11 08:41:46