Extract content of <Script> in Python with BeautifulSoup

Question

I want to extract value of window. FEED__INITIAL__STATE

How can I do it?

Answer 1

Maybe you should try like this:

import requests
from bs4 import BeautifulSoup

def check_script_tag(url):

    r = requests.get(url)
    parsed_html = BeautifulSoup(r.content, features="html.parser")

    try:
        text = parsed_html.body.find('script').text
        print (text)  # Here text in script tag !!
    except AttributeError:
        print("There is no script tag !!")

check_script_tag("https://stackoverflow.com")

Answer 2

First, we have to find all the scripts tag and then match it,

ps - updated in RasitAydin code

import requests
from bs4 import BeautifulSoup


def check_script_tag(url):
    r = requests.get(url)
    parsed_html = BeautifulSoup(r.content, features="html.parser")

    script_tags = parsed_html.body.find_all('script')
    for script_tag in script_tags:
        text = script_tag.text
        if 'window.FEED__INITIAL__STATE'.lower() in text.lower():
            print(text)


check_script_tag(" YOUR WEB URL")

Extract content of <Script> in Python with BeautifulSoup

Question

2 answers

solution1
1 2018-09-15 07:40:21

solution2
0 2018-09-15 11:09:09

Extract content of <Script> in Python with BeautifulSoup

Question

2 answers

solution1 1 2018-09-15 07:40:21

solution2 0 2018-09-15 11:09:09

solution1
1 2018-09-15 07:40:21

solution2
0 2018-09-15 11:09:09