字符串格式选项：优点和缺点

Question

这是在Python中格式化字符串的两种非常流行的方法。 一个是使用dict ：

>>> 'I will be %(years)i on %(month)s %(day)i' % {'years': 21, 'month': 'January', 'day': 23}
'I will be 21 on January 23'

而另一个使用简单的tuple ：

>>> 'I will be %i on %s %i' % (21, 'January', 23)
'I will be 21 on January 23'

第一个是更具可读性，但第二个更快写。 我实际上模糊地使用它们。

每个人的利弊是什么？ 关于性能，可读性，代码优化（其中一个转换为另一个？）以及您认为有用的任何其他内容。

Answer 1

为什么`format()`比`%` string操作更灵活

我认为你应该坚持str format()方法，因为它是格式化字符串的首选方式，并可能在将来取代字符串格式化操作。

此外，它还有一些非常好的功能，它们还可以将基于位置的格式与基于关键字的格式相结合 ：

>>> string = 'I will be {} years and {} months on {month} {day}'
>>> some_date = {'month': 'January', 'day': '1st'}
>>> diff = [3, 11] # years, months
>>> string.format(*diff, **some_date)
'I will be 3 years and 11 months on January 1st'

即便是以下工作：

>>> string = 'On {month} {day} it will be {1} months, {0} years'
>>> string.format(*diff, **some_date)
'On January 1st it will be 11 months, 3 years'

还有一个原因支持format() 。 因为它是一个方法，所以它可以作为回调传递，如下例所示：

>>> data = [(1, 2), ('a', 'b'), (5, 'ABC')]
>>> formatter = 'First is "{0[0]}", then comes "{0[1]}"'.format
>>> for item in map(formatter, data):
    print item


First is "1", then comes "2"
First is "a", then comes "b"
First is "5", then comes "ABC"

它不是比字符串格式化操作更灵活吗？

有关% operations和.format()方法之间的比较，请参阅文档页面上的更多示例。

比较基于元组的`%`字符串格式与基于字典的格式

通常有三种方法可以调用%字符串操作 （是，三，而不是两个）：

base_string % values

它们因values的类型而不同（这是base_string的内容的结果）：

它可以是一个tuple ，然后按照它们出现在元组中的顺序逐个替换它们，

 >>> 'Three first values are: %f, %f and %f' % (3.14, 2.71, 1) 'Three first values are: 3.140000, 2.710000 and 1.000000'

它可以是一个dict （字典），然后根据关键字替换它们，

 >>> 'My name is %(name)s, I am %(age)s years old' % {'name':'John','age':98} 'My name is John, I am 98 years old'

它可以是单个值，如果base_string包含应插入值的单个位置：
```
 >>> 'This is a string: %s' % 'abc' 'This is a string: abc' 
```

它们之间存在明显的差异，并且这些方式不能组合（与能够组合某些特征的format()方法相反，如上所述）。

但是有一些东西仅特定于基于字典的字符串格式化操作，并且在剩余的三种格式化操作类型中相当不可用。 这是以简单的方式用实际变量名替换特定程序的能力：

>>> name = 'John'
>>> surname = 'Smith'
>>> age = 87
# some code goes here
>>> 'My name is %(surname)s, %(name)s %(surname)s. I am %(age)i.' % locals()
'My name is Smith, John Smith. I am 87.'

仅供记录：当然，通过使用format()解压缩字典可以很容易地取代上面的内容：

>>> 'My name is {surname}, {name} {surname}. I am {age}.'.format(**locals())
'My name is Smith, John Smith. I am 87.'

有没有其他人知道什么是特定于一种类型的字符串格式化操作的功能，而不是另一种？ 听到它可能会很有趣。

Answer 2

我并没有完全回答你的问题，但只是认为把format放到你的混音中会很好。

我个人更喜欢format的语法：

'I will be {years} on {month} {day}'.format(years=19, month='January', day=23)

如果我想要一些紧凑的东西，我只想写：

'I will be {} on {} {}'.format(19, 'January', 23)

format与对象很好地匹配：

class Birthday:
  def __init__(self, age, month, day):
    self.age = age
    self.month = month
    self.day = day

print 'I will be {b.age} on {b.month} {b.day}'.format(b = Birthday(19, 'January', 23))

Answer 3

我没有回答这个问题，只是解释了我在TIScript中提出的想法。

我已经引入了所谓的“字符串化”函数：名称从'$'开始的任何函数都是字符串化器。 编译器将'$ name（'和'）'视为字符串文字与函数调用相结合的引号。

例如，这个：

$print(I will be {b.age} on {b.month} {b.day});

实际上编译成

$print("I will be ", b.age, " on ",b.month," ",b.day);

其中偶数参数总是文字字符串而奇数字符串是表达式。 这样就可以定义使用不同格式/参数处理的自定义字符串化器。

例如Element.$html(Hello <b>{who}</b>); 将在表达式上应用HTML转义。 而这个Element.$(option[value={12}]); 将以jQuery样式进行选择。

非常方便灵活。

我不确定是否可以在Python中执行类似的操作而不更改其编译器。 考虑一下这个想法。

字符串格式选项：优点和缺点

问题描述

3 个解决方案

解决方案1
21 已采纳 2011-12-06 06:14:21

为什么`format()`比`%` string操作更灵活

比较基于元组的`%`字符串格式与基于字典的格式

解决方案2
20 2011-12-06 05:58:35

解决方案3
-2 2011-12-06 06:31:09

字符串格式选项：优点和缺点

问题描述

3 个解决方案

解决方案1 21 已采纳 2011-12-06 06:14:21

为什么format()比% string操作更灵活

比较基于元组的%字符串格式与基于字典的格式

解决方案2 20 2011-12-06 05:58:35

解决方案3 -2 2011-12-06 06:31:09

解决方案1
21 已采纳 2011-12-06 06:14:21

为什么`format()`比`%` string操作更灵活

比较基于元组的`%`字符串格式与基于字典的格式

解决方案2
20 2011-12-06 05:58:35

解决方案3
-2 2011-12-06 06:31:09