简体繁体中英

Extract complete URL from a link

原文 2022-12-19 18:15:06 5 1 scrapy/ playwright-python

I am scrapping amanzon.co.in using scrapy-playwright. I am able to extract description, rating and price of desired item. However for going to next page I want to extract href for Next Page button at the bottom of the page.

Thru scrapy-playwright python code I am able to extract href of next button as: href="/s?k=Soap+for+men&page=2"

When I extract URL using the browser, it appears like: https://www.amazon.in/s?k=soap+for+men&page=2&crid=1A43B14UY65X0&qid=1671472636&sprefix=soap+for+men%2Caps%2C262&ref=sr_pg_1

How do I get generate complete URL from the link including crid extracted thru code?

1 answers

crid, qid and sprefix are query parameters to specify additional information about the request being made to the server.

crid: This stands for "customer request ID". It is a unique identifier that is generated by Amazon to track customer requests.

qid: This stands for "query ID". It is a unique identifier that is generated by Amazon to track search queries.

sprefix: This stands for "search prefix". It specifies the prefix for the search query, which can be used to refine the search results.

These query parameters are used by Amazon to track and optimize the performance of their search function. They do not necessarily have any meaning to the user or the content of the page being requested. You can run your spider without these query parameters and it won't make any differance to the output.

How to extract the website URL from the redirect link with Scrapy Python

How to extract all the content of each link in complete domain

Extract data from the second level link, scrapy

Xpath: extract a link from the href tag

scrapy extract source url from javascript

how to extract certain string from URL

How to extract request url w.r.t. response url when using link extractor in scrapy?

Is there a way to get the URL that a link is scraped from?

Scrapy - extract href from link with specific attribute value

Problem to extract the href link from the soup find result

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to extract the website URL from the redirect link with Scrapy Python How to extract all the content of each link in complete domain Extract data from the second level link, scrapy Xpath: extract a link from the href tag scrapy extract source url from javascript how to extract certain string from URL How to extract request url w.r.t. response url when using link extractor in scrapy? Is there a way to get the URL that a link is scraped from? Scrapy - extract href from link with specific attribute value Problem to extract the href link from the soup find result

Related Tags

Extract complete URL from a link

Question

1 answers

solution1 0 2022-12-21 11:46:34

solution1
0 2022-12-21 11:46:34