从Perl中的字符串中删除字符和数字

Question

我正在尝试重命名目录中的一堆文件，并且卡在了它的正则表达式部分。

我想从文件名中删除出现在开头的某些字符。

_00-author--book_revision_ 1： _00-author--book_revision_

预期： Author - Book (Revision)

到目前为止，我已经可以使用正则表达式删除下划线并首字母大写

$newfile =~ s/_/ /g;
$newfile =~ s/^[0-9]//g;
$newfile =~ s/^[0-9]//g;
$newfile =~ s/^-//g;
$newfile = ucfirst($newfile);

这不是一个好方法。 在删除所有字符之前，我需要帮助，直到您击中第一个字母，并且当您击中第一个“-”时，我想在“-”之前和之后添加一个空格。 另外，当我按下第二个'-'时，我想将其替换为'（'。

非常感谢您采取正确方法的任何指导，技巧甚至建议。

Answer 1

您的说明和示例不匹配。

根据您的指示，

s/^[^\pL]+//;    # Remove everything until first letter.
s/-/ - /;        # Replace first "-" with " - "
s/-[^-]*\K-/(/;  # Replace second "-" with "("

根据您的示例，

s/^[^\pL]+//;
s/--/ - /;
s/_/ (/;
s/_/)/;
s/(?<!\pL)(\pL)/\U$1/g;

Answer 2

$filename =~ s,^_\d+-(.*?)--(.*?)_(.*?)_$,\u\1 - \u\2 (\u\3),;

我的Perl解释器（使用严格和警告）说，最好这样写：

$filename =~ s,^_\d+-(.*?)--(.*?)_(.*?)_$,\u$1 - \u$2 (\u$3),;

第一个可能更喜欢它的味道！ （当然，两个版本的工作原理相同。）

说明（按stema的要求）：

$filename =~ s/
  ^       # matches the start of the line
  _\d+-   # matches an underscore, one or more digits and a hypen minus
  (.*?)-- # matches (non-greedyly) anything before two consecutive hypen-minus
          #   and captures the entire match (as the first capture group)
  (.*?)_  # matches (non-greedyly) anything before a single underscore and
          #  captures the entire match (as the second capture group)
  (.*?)_  # does the same as the one before (but captures the match as the
          #  third capture group obviously)
  $       # matches the end of the line
/\u$1 - \u$2 (\u$3)/x;

替换规范中的\\u${1..3}仅告诉Perl将捕获组从1到3插入，它们的第一个字符大写。 如果要使整个匹配（在捕获的组中）大写，则必须改用\\U

x标志打开了详细模式，该模式告诉Perl解释器我们要使用＃注释，因此它将忽略这些注释（以及正则表达式中的任何空格-因此，如果要匹配空格，则必须使用\\s或\\ ）。 不幸的是，我无法弄清楚如何让Perl忽略*替换*规范中的空白-这就是为什么我在一行上编写了空白。

（另请注意，我已经改变了我的s终止从,到/ - Perl的咆哮在我，如果我用了,用详细模式开启...不知道是什么原因。）

Answer 3

那么，您是要大写新文件名的所有组成部分还是仅将第一个大写？ 您的问题在这一点上是不一致的。

请注意，如果您使用的是Linux，则可能有rename命令，该命令将使用perl表达式并使用它为您重命名文件，如下所示：

rename 'my ($a,$b,$r);$_ = "$a - $b ($r)" 
  if ($a, $b, $r) = map { ucfirst $_ } /^_\d+-(.*?)--(.*?)_(.*?)_$/' _*

Answer 4

如果它们都遵循该格式，请尝试：

my ($author, $book, $revision) = $newfiles =~ /-(.*?)--(.*?)_(.*?)_/;

print ucfirst($author ) . " - $book ($revision)\n";

从Perl中的字符串中删除字符和数字

问题描述

4 个解决方案

解决方案1
1 2012-05-02 05:58:32

解决方案2
1 2012-05-02 06:05:31

解决方案3
1 已采纳 2012-05-02 06:18:58

解决方案4
0 2012-05-02 06:02:56

从Perl中的字符串中删除字符和数字

问题描述

4 个解决方案

解决方案1 1 2012-05-02 05:58:32

解决方案2 1 2012-05-02 06:05:31

解决方案3 1 已采纳 2012-05-02 06:18:58

解决方案4 0 2012-05-02 06:02:56

解决方案1
1 2012-05-02 05:58:32

解决方案2
1 2012-05-02 06:05:31

解决方案3
1 已采纳 2012-05-02 06:18:58

解决方案4
0 2012-05-02 06:02:56