Python & Scrapy 輸出：“\\r\\n\\t\\t\\t\\t\\t\\t\\t”

Question

我正在學習使用 Scrapy 進行抓取，並且在某些代碼上遇到了一些問題，這給了我一個我不理解的奇怪輸出。 有人可以向我解釋為什么我得到一堆“\\r\\n\\t\\t\\t\\t\\t\\t\\t”

我在 Stack Overflow 上找到了這個解決方案： Remove an '\\\\n\\\\t\\\\t\\\\t'-element from list

但我想了解是什么導致了它。

這是我的代碼導致我的問題。 上面鏈接中的 Strip 方法解決了這個問題，但如前所述，我不明白它來自哪里。

import scrapy
import logging
import re

class CitySpider(scrapy.Spider):
    name = 'city'
    allowed_domains = ['www.a-tembo.nl']
    start_urls = ['https://www.a-tembo.nl/themas/category/city/']

    def parse(self, response):
        titles = response.xpath("//div[@class='hikashop_category_image']/a")
        
        for title in titles:
            series = title.xpath(".//@title").get()
            link = title.xpath(".//@href").get()

            #absolute_url = f"https://www.a-tembo.nl{link}"
            #absolute_url = response.urljoin(link)

            yield response.follow(link, callback=self.parse_title)

    def parse_title(self, response):
        rows = response.xpath("//table[@class='hikashop_products_table adminlist table']/tbody/tr")

        for row in rows:
            product_code = row.xpath(".//span[@class='hikashop_product_code']/text()").get()
            product_name = row.xpath(".//span[@class='hikashop_product_name']/a/text()").get()

            yield{
                "Product_code": product_code,
                "Product_name": product_name
                       
            }

Answer 1

\\n類的字符稱為轉義字符。 例如： \\n表示換行， \\t表示制表符。 網站上充滿了它們，盡管您在不檢查頁面的情況下永遠不會看到它們。 如果您想了解有關 Python 中轉義字符的更多信息，您可以在此處閱讀有關它們的信息。 我希望這能回答你的問題。

Python & Scrapy 輸出：“\\r\\n\\t\\t\\t\\t\\t\\t\\t”

問題描述

1 個解決方案

解決方案1
1 2020-10-12 13:05:16

Python &amp; Scrapy 輸出：“\\r\\n\\t\\t\\t\\t\\t\\t\\t”

問題描述

1 個解決方案

解決方案1 1 2020-10-12 13:05:16

Python & Scrapy 輸出：“\\r\\n\\t\\t\\t\\t\\t\\t\\t”

解決方案1
1 2020-10-12 13:05:16