Storing everything after a certain word in a line, in a list- Regex

Question

So I have a line,

unicomp6.unicomp.net - - [01/Jul/1995:00:00:14 -0400] "GET /images/NASA-logosmall.gif HTTP/1.0" 200 786

And I want to store everything after HTTP/1.0" (so those two numbers) into list, how would I do this using regex? I have read the docs on them but they confuse me a bit.

Answer 1

You can use regex101 , to construct regular expressions which suit your need.

For your particular example, the following RE would work:

HTTP\/1.0.(.*$)

Explanation:

Capture in group everthing after HTTP 1.0"

Gives output:

` 200 786`

Answer 2

import re
text = 'unicomp6.unicomp.net - - [01/Jul/1995:00:00:14 -0400] "GET /images/NASA-logosmall.gif HTTP/1.0" 200 786'
regex = r'HTTP/1.0".*$'
match = re.search(regex, text)
list_with_numbers = match.groups()[0].split()

Answer 3

You don't need regex for this, you can use built-in str methods. Eg,

s = 'unicomp6.unicomp.net - - [01/Jul/1995:00:00:14 -0400] "GET /images/NASA-logosmall.gif HTTP/1.0" 200 786'
data = s.partition('HTTP/1.0" ')
nums = data[2].split()
print(nums)

output

['200', '786']

You could also use .split() instead of .partition() , but I think .partition() is more natural here. Note that the numbers stored in nums are strings, so you'll need to add a conversion step if you need to do arithmetic with them.

Here's an example using .split() instead of .partition() that converts the number strings to integers.

data = s.split('HTTP/1.0"')
nums = [int(u) for u in data[1].split()]
print(nums)

output

[200, 786]

Answer 4

Do you have to use a regular expression? If not, you could do this:

>>> lines = ['unicomp6.unicomp.net - - [01/Jul/1995:00:00:14 -0400] "GET /images/NASA-logosmall.gif HTTP/1.0" 200 786']
>>> 
>>> numbers = [line.split()[-2:] for line in lines]
>>> numbers
[['200', '786']]
>>>

This assumes that "the last two whitespace-delimited strings" is equivalent to what you want.

Storing everything after a certain word in a line, in a list- Regex

Question

4 answers

solution1
2 ACCPTED 2015-08-12 07:59:35

solution2
2 2015-08-12 08:00:17

solution3
1 2015-08-12 07:58:25

solution4
0 2015-08-12 08:01:16

Storing everything after a certain word in a line, in a list- Regex

Question

4 answers

solution1 2 ACCPTED 2015-08-12 07:59:35

solution2 2 2015-08-12 08:00:17

solution3 1 2015-08-12 07:58:25

solution4 0 2015-08-12 08:01:16

solution1
2 ACCPTED 2015-08-12 07:59:35

solution2
2 2015-08-12 08:00:17

solution3
1 2015-08-12 07:58:25

solution4
0 2015-08-12 08:01:16