Pythonic way of parsing this string?

Question

I'm parsing this line -

0386          ; Greek # L&       GREEK CAPITAL LETTER ALPHA WITH TONOS

Basically, I need -

point = 0386
script = Greek

And I'm doing it like this,

point = line.split(";")[0].replace(" ","")
script = line.split("#")[0].split(";")[1].replace(" ","")

I'm not convinced that what I'm doing is the most pythonic way of doing it, is there a more elegant way of doing this? Maybe a regex one-liner?

Answer 1

If you want a regex one liner:

point, script = re.search("^(\d+)\s*;\s*(\S+)\s*.*$",s).groups()

where s is your string, and of course you need to import re

Answer 2

>>> code, desc = line[:line.rfind('#')].split(';')
>>> code.strip()
'0386'
>>> desc.strip()
'Greek'

Answer 3

Using map with unbound method str.strip :

>>> line = '0386      ; Greek # L&   GREEK CAPITAL LETTER ALPHA WITH TONOS'
>>> point, script = map(str.strip, line.split('#')[0].split(';'))
>>> point
'0386'
>>> script
'Greek'

Using list comprehension:

>>> point, script = [word.strip() for word in line.split('#')[0].split(';')]
>>> point
'0386'
>>> script
'Greek'

Answer 4

This is how I would've done it:

>>> s = "0386          ; Greek # L&       GREEK CAPITAL LETTER ALPHA WITH TONOS"
>>> point = s.split(';')[0].strip()
>>> point
'0386'
>>> script = s.split(';')[1].split('#')[0].strip()
>>> script
'Greek'

Note that you can re-use s.split(';') . So perhaps saving it to a var would be a good idea:

>>> var = s.split(';')
>>> point = var[0].strip()  # Strip gets rid of all the whitespace
>>> point
'0386'
>>> script = var[1].split('#')[0].strip()
>>> script
'Greek'

Pythonic way of parsing this string?

Question

4 answers

solution1
3 2014-01-06 09:25:56

solution2
3 2014-01-06 09:28:31

solution3
2 ACCPTED 2014-01-06 09:21:24

solution4
0 2014-01-06 09:20:38

Pythonic way of parsing this string?

Question

4 answers

solution1 3 2014-01-06 09:25:56

solution2 3 2014-01-06 09:28:31

solution3 2 ACCPTED 2014-01-06 09:21:24

solution4 0 2014-01-06 09:20:38

solution1
3 2014-01-06 09:25:56

solution2
3 2014-01-06 09:28:31

solution3
2 ACCPTED 2014-01-06 09:21:24

solution4
0 2014-01-06 09:20:38