我可以在 python 中为 dict 动态创建密钥吗？

Question

I am creating a spider in Scrapy.我正在 Scrapy 中创建一个蜘蛛。 And I want to scrape table in this way:我想以这种方式刮桌子：

Take every <tr>获取每个<tr>
Use <th> as key and <td> as content使用<th>作为键，使用<td>作为内容

The code I came up with is this.我想出的代码是这样的。

def parse(self, response):
        item = {}
        item['code'] = response.xpath('//meta[@itemprop="sku"]/@content').extract_first()
        tables = response.css('.technical-specs')
        for table in tables:
            specs = tables.xpath('tbody/tr')
            for s in specs:
                key = s.xpath('th/text()').extract_first().replace(" ", "_").replace("(", "_").replace(")", "_").replace("/", "").lower()
                value = s.xpath('td/text()').extract_first()
                item[key] = value


        return item

But it is not working.但它不起作用。 Is this posible to achieve?这有可能实现吗？

Answer 1

You need to create a dict instance and then add the items inside the loop.您需要创建一个 dict 实例，然后在循环内添加项目。 Eg:例如：


my_dict = dict() # Can be {} to

for item in items:
  key = item.key
  value = item.value
  my_dict[key] = value

Regards

Answer 2

The, now working, code of parse function is updated in my question details.我的问题详细信息中更新了解析 function 的现在工作代码。 Problem was not in the way loop or dictionary was implemented, but in how I extracted data.问题不在于循环或字典的实现方式，而在于我如何提取数据。 I was using .extract() which makes response unicode and "unscrapable".我正在使用.extract()这使得响应 unicode 和“不可回收”。 Removing.extract was the fix. Remove.extract 是解决方法。

我可以在 python 中为 dict 动态创建密钥吗？

问题描述

2 个解决方案

解决方案1
0 2020-04-27 14:19:50

解决方案2
0 2020-04-27 18:55:04

我可以在 python 中为 dict 动态创建密钥吗？

问题描述

2 个解决方案

解决方案1 0 2020-04-27 14:19:50

解决方案2 0 2020-04-27 18:55:04

解决方案1
0 2020-04-27 14:19:50

解决方案2
0 2020-04-27 18:55:04