How to get a certain table tag from div tag inside of html?

Question

I'm trying to get the table information from the web-site http://www.o1vsk.lv/index.php/stundu-izmainas . html content of the web-page i need to extract

from bs4 import BeautifulSoup
from urllib.request import urlopen
html = urlopen("http://www.o1vsk.lv/index.php/stundu-izmainas").read()

rows=[]
soup=BeautifulSoup(html,"html.parser")
box = soup.find('div', {'class': 'DRight'})

This program gets all content of the page, while I need only one little table in the text format like:

the table i need to get in the text format 6.d

...
...
...
...
...
...
...
...

Answer 1

sorry I cannot comment yet due to my reputation is < 50

Here is my solution for you.

Find all the table tag and it will return HTML code

table = box.findAll("table")

Convert the HTML to Pandas DataFrame (df). Why index = 1? Because the table you want in the index 1

df = pd.read_html(str(table))[1]

Lastly, remove Unnamed column to get only the needed column

df.loc[:, ~df.columns.str.match('Unnamed')]

Here is the full code:

from pandas import pd
from bs4 import BeautifulSoup
from urllib.request import urlopen
html = urlopen("http://www.o1vsk.lv/index.php/stundu-izmainas").read()

rows=[]
soup=BeautifulSoup(html,"html.parser")
box = soup.find('div', {'class': 'DRight'})

table = box.findAll("table")
df = pd.read_html(str(table))[1]

df.loc[:, ~df.columns.str.match('Unnamed')]

please upvote if this help you:) thanks

How to get a certain table tag from div tag inside of html?

Question

1 answers

solution1
1 ACCPTED 2022-08-10 15:45:35

How to get a certain table tag from div tag inside of html?

Question

1 answers

solution1 1 ACCPTED 2022-08-10 15:45:35

solution1
1 ACCPTED 2022-08-10 15:45:35