PHP-从字符串中删除不以特定单词开头的单词（http | https | www | .com | .net）

Question

I have a string with some text and some URLs in it. 我有一个带有一些文本和一些URL的字符串。 My goal is to remove the following from the string: 我的目标是从字符串中删除以下内容：

$removeThis = array('http://', 'https://', 'www.', '.com', '.net'); $ removeThis = array（'http：//'，'https：//'，'www。'，'.com'，'.net'）;

BUT ONLY IF the word to be removed doesn't start with: http://good.com , http://www.good.com , https://good.com , or https://www.good.com . 但是仅当要删除的单词不是以以下开头时： http : //good.com，http : //www.good.com，https : //good.com或https://www.good.com 。

In other words, I want to remove http|s|www.|.com|.net parts from the string (but only if they don't belong to good.com domain). 换句话说，我想从字符串中删除http | s | www。| .com | .net部分（但仅当它们不属于good.com域时）。

INPUT: INPUT：

$string='Hello world, this is spamming: www.spam.com, spam.net, https://spam.com, https://spam.com/tester. And this is not spam so do not touch it: http://www.good.com/okay, http://good.com, and also https://good.com/well';

RESULT SHOULD BE: 结果应为：

Hello world, this is spamming: spam, spam, spam, spam/tester. And this is not spam so do not touch it: http://www.good.com/okay, http://good.com, and also https://good.com/well

I think preg_replace is needed here.. 我认为在这里需要preg_replace。

Answer 1

try below: 请尝试以下方法：

  $preg = '/(?:(http|https):\/\/)?(?:www\.)?\w+\.(com|net)/i';

$str = preg_replace_callback($preg, function($matches) {
    $removeThis = array('/http:\/\//i', 'https://', 'www.', '.com', '.net');
    if (preg_match('/(http|https):\/\/(www\.)?good\.(com|net)/i', $matches[0])) return $matches[0];
    return preg_replace('/((http|https):\/\/|www\.|\.com|\.net)/i', '', $matches[0]);
}, $string);

Answer 2

This might help you: 这可能对您有帮助：

$url = "www.good.net/tooooo.php";
$regex = array('/(https?:..)/','/^www\./','/(\.com.|\.net.|\.co.)+([^\s]+)/');
$url = preg_replace($regex, '', $url);
echo $url;

Answer 3

You should use REGEX which are really powerful, here the step to do it pretty easily : 您应该使用真正强大的REGEX，在这里轻松完成此步骤：

Match all urls using preg_replace_callback 使用preg_replace_callback匹配所有网址
In callback function, detect if it belongs to the whitelisted domain or not (preg_match or strrpos) 在回调函数中，检测它是否属于白名单域（preg_match或strrpos）
Still in callback function : Treat the string in consequence and return it 仍在回调函数中：处理结果字符串并返回

Regex for urls : 网址的正则表达式：

#^(https?|ftp):\/\/(-\.)?([^\s\/?\.#]+\.?)+(\/[^\s]*)?$#

PHP-从字符串中删除不以特定单词开头的单词（http | https | www | .com | .net）

问题描述

3 个解决方案

解决方案1
1 已采纳 2015-04-27 02:30:37

解决方案2
0 2015-04-27 02:01:50

解决方案3
0 2015-04-27 02:21:00

PHP-从字符串中删除不以特定单词开头的单词（http | https | www | .com | .net）

问题描述

3 个解决方案

解决方案1 1 已采纳 2015-04-27 02:30:37

解决方案2 0 2015-04-27 02:01:50

解决方案3 0 2015-04-27 02:21:00

解决方案1
1 已采纳 2015-04-27 02:30:37

解决方案2
0 2015-04-27 02:01:50

解决方案3
0 2015-04-27 02:21:00