正则表达式删除尾随字符

Question

I'm looking for a regular expression in Php that could transform incoming strings like this: 我正在寻找PHP中的正则表达式，它可以转换这样的传入字符串：

abaisser_negation_pronominal_question     => abaisser_n_p_q
abaisser_pronominal_question              => abaisser_p_q
abaisser_negation_question                => abaisser_n_q
abaisser_negation_pronominal              => abaisser_n_p
abaisser_negation_voix_passive_pronominal => abaisser_n_v_p_p
abaisser                                  => abaisser

With the Php code close to something like: 与PHP代码接近的是：

$line=preg_replace("/<h3>/im", "", $line);

How would you do? 你会怎么做？

Answer 1

You can use: 您可以使用：

$input = preg_replace('/(_[A-Za-z])[^_\n]*/', '$1', $input);

RegEx Demo 正则演示

Explanation: 说明：

This regex searches for (_[A-Za-z])[^_\\n]* which means underscore followed by single letter and then match before a newline or underscore 此正则表达式搜索(_[A-Za-z])[^_\\n]* ，表示下划线后跟单个字母，然后在换行符或下划线前匹配
It capture first part (_[A-Za-z]) in a backreference $1 它在向后引用$1捕获第一部分(_[A-Za-z])
Replacement is $1 leaving underscore and first letter in the replacement string 替换为$1 ，替换字符串中保留下划线和第一个字母

Answer 2

$line = preg_replace("/_([a-z])([a-z]*)/i", "_$1", $line);

Answer 3

You can use regex 您可以使用正则表达式

$input = preg_replace('/_(.)[^\n_]+/', '_$1', $input);

DEMO DEMO

What it does is capture the character after _ and match till \\n or _ is encountered and replaced with the _$1 which means _ plus the character captured. 它的作用是捕获_之后的字符并匹配，直到遇到\\n或_为止，然后替换为_$1 ，这意味着_加上捕获的字符。

Answer 4

You could use \\K or positive lookbehind. 您可以使用\\K或正向后看。

$input = preg_replace('~_.\K[^_\n]*~', '', $input);

Pattern _. 模式_. in the above regex would match an _ and also the character following the underscore. 上面的正则表达式中的_和下划线后的字符都匹配。 \\K discards the previously matched characters that is, _ plus the following character. \\K丢弃先前匹配的字符，即_和后续字符。 It won't take these two characters into consideration. 不会考虑这两个字符。 Now [^_\\n]* matches any character but not of an _ or a \\n newline character zero or more times. 现在[^_\\n]*匹配任何字符，但不匹配_或\\n换行符零次或多次。 So the characters after the character which was preceded by an underscore would be matched upto the next _ or \\n character. 因此，在下划线之前的字符之后的字符将与下一个_或\\n字符匹配。 Removing those characters will give you the desired output. 删除这些字符将为您提供所需的输出。

DEMO DEMO

$input = preg_replace('~(?<=_.)[^_\n]*~', '', $input);

It just looks after to the _ and the character following the _ and matches all the characters upto the next underscore or newline character. 它只是看起来到后_和字符后的_和高达下一个下划线或换行符的所有字符匹配。

DEMO DEMO

正则表达式删除尾随字符

问题描述

4 个解决方案

解决方案1
2 2014-12-20 15:48:23

RegEx Demo 正则演示

解决方案2
0 2014-12-20 15:47:10

解决方案3
0 2014-12-20 15:52:25

解决方案4
0 已采纳 2014-12-20 16:27:23

正则表达式删除尾随字符

问题描述

4 个解决方案

解决方案1 2 2014-12-20 15:48:23

RegEx Demo 正则演示

解决方案2 0 2014-12-20 15:47:10

解决方案3 0 2014-12-20 15:52:25

解决方案4 0 已采纳 2014-12-20 16:27:23

解决方案1
2 2014-12-20 15:48:23

解决方案2
0 2014-12-20 15:47:10

解决方案3
0 2014-12-20 15:52:25

解决方案4
0 已采纳 2014-12-20 16:27:23