Remove newlines in beautiful soup

Question

In BeautifulSoup, I have the following:

>>> tr = soup.find_all('tr')[1]
<tr>
<td>Adaptive Systems Seminar (HOC+WPO)</td>
<td>wo</td>
<td>13:00</td>
<td>17:00</td>
<td>4:00</td>
<td>22-29, 32-36</td>
<td>MANDERICK BERNARD</td>
<td> </td>
</tr>

However, I'm just interested in the text. So I do

>>> tr(text=True)
[u'\n', u'Adaptive Systems Seminar (HOC+WPO)', u'\n', u'wo', u'\n', u'13:00', u'\n', u'17:00', u'\n', u'4:00', u'\n', u'22-29, 32-36', u'\n', u'MANDERICK BERNARD', u'\n', u'\xa0', u'\n']

I'd like to get the list above, but without all the newlines . I've read the documentation but I can't find anything about it.

Answer 1

One option would be to find all td elements inside and use get_text() :

In [4]: [td.get_text(strip=True) for td in soup.select("tr > td")]
Out[4]: 
[u'Adaptive Systems Seminar (HOC+WPO)',
 u'wo',
 u'13:00',
 u'17:00',
 u'4:00',
 u'22-29, 32-36',
 u'MANDERICK BERNARD',
 u'']

Remove newlines in beautiful soup

Question

1 answers

solution1
1 2015-06-28 19:09:01

Remove newlines in beautiful soup

Question

1 answers

solution1 1 2015-06-28 19:09:01

solution1
1 2015-06-28 19:09:01