简体繁体中英

Split string with commas also splits ampersands

原文 2012-04-19 04:38:43 0 2 python/ html-parser

The code below parses an HTML, the trouble is splitting when ampersands appear in the data.

from HTMLParser import HTMLParser

data = '<HTML><meta http-equiv="Pragma" content="no-cache"></head>'\
'<body>107,1,236,1000,70,498,NameA NameB & NameC - ActionA ActionB</body></html>'

class MyHTMLParser(HTMLParser):
      def handle_data(self, data):
            print data.split(',')

parser = MyHTMLParser()
parser.feed(data)

Output
It is splitting the '&' instead of only commas.

['107', '1', '236', '1000', '70', '498', 'NameA NameB ']
['&']
[' NameC - ActionA ActionB']

Thanks

2 answers

好吧，我认为这是要走的路，

data2 = data.replace('&', 'and')

另一种解决方案是，在<body>标记中获取值，然后使用Beautifulsoup或您选择的任何库使用data.split(',')进行解析。

Split names separated with commas when surnames are also separated with commas

Split a string column and put the splits in different columns

Split DataFrame string column into N splits

Split string based on number of commas

Split string with commas to new line repeating the string

how to split string into array on commas but ignore commas in parentheses

Split string on commas but ignore commas within double-quotes?

python - split iterable according string and make splits of same length

How to split a string on commas or periods in nltk

python re split string by commas and space

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Split names separated with commas when surnames are also separated with commas Split a string column and put the splits in different columns Split DataFrame string column into N splits Split string based on number of commas Split string with commas to new line repeating the string how to split string into array on commas but ignore commas in parentheses Split string on commas but ignore commas within double-quotes? python - split iterable according string and make splits of same length How to split a string on commas or periods in nltk python re split string by commas and space

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM