How to pull some information from a string with Python?

Question

I'm just starting to play around with BeautifulSoup and I'm trying to create something in Python but when I scrape for the information the tags are included in the results which I do not want, is there anyway I can seperate the product ID from the tags?

Example of my results:

<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>
<product-id type="integer">8422899464</product-id>

Answer 1

Try something like this if you want to get the data of product-id:

data = soup.find('product-id').getText()
print(data)

Answer 2

[i.text for i in soup('product-id')]

out:

['8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464',
 '8422899464']

How to pull some information from a string with Python?

Question

2 answers

solution1
3 2016-12-26 12:46:14

solution2
2 2016-12-26 12:53:02

How to pull some information from a string with Python?

Question

2 answers

solution1 3 2016-12-26 12:46:14

solution2 2 2016-12-26 12:53:02

solution1
3 2016-12-26 12:46:14

solution2
2 2016-12-26 12:53:02