PHP正則表達式僅從URL清除特定字符串

Question

任何正則表達的忍者在那里想出一個PHP解決方案來清除任何http / url中的標簽，但是將標簽留在文本的其余部分？

例如：

the word <cite>printing</cite> is in http://www.thisis<cite>printing</cite>.com

應成為：

the word <cite>printing</cite> is in http://www.thisisprinting.com

Answer 1

這就是我要做的：

<?php
//a callback function wrapper for strip_tags
function strip($matches){
    return strip_tags($matches[0]);
}

//the string
$str = "the word <cite>printing<cite> is in http://www.thisis<cite>printing</cite>.com";
//match a url and call the strip callback on it
$str = preg_replace_callback("/:\/\/[^\s]*/", 'strip', $str);

//prove that it works
var_dump(htmlentities($str));

http://codepad.viper-7.com/XiPcs9

Answer 2

適合此替換的正則表達式可能是：

#(https?://)(.*?)<cite>(.*?)</cite>([^\s]*)#s

s標志在所有換行符中匹配。
在標簽之間使用lazy選擇，以准確無法逃避更多類似的標簽

片段：

<?php
$str = "the word <cite>printing<cite> is in http://www.thisis<cite>printing</cite>.com";
$replaced = preg_replace('#(https?://)(.*?)<cite>(.*?)</cite>([^\s]*)#s', "$1$2$3$4", $str);
echo $replaced;

// Output: the word <cite>printing<cite> is in http://www.thisisprinting.com

現場演示

Answer 3

假設您可以從文本中識別URL，您可以：

$str = 'http://www.thisis<cite>printing</cite>.com';
$str = preg_replace('~</?cite>~i', "", $str);
echo $str;

OUTPUT：

http://www.thisisprinting.com

PHP正則表達式僅從URL清除特定字符串

問題描述

3 個解決方案

解決方案1
1 2013-10-24 21:56:18

解決方案2
1 2013-10-24 22:02:35

解決方案3
0 2013-10-24 21:48:46

PHP正則表達式僅從URL清除特定字符串

問題描述

3 個解決方案

解決方案1 1 2013-10-24 21:56:18

解決方案2 1 2013-10-24 22:02:35

解決方案3 0 2013-10-24 21:48:46

解決方案1
1 2013-10-24 21:56:18

解決方案2
1 2013-10-24 22:02:35

解決方案3
0 2013-10-24 21:48:46