preg_match获取文本

Question

I have test.php and on test1.php i have this php code running 我有test.php和test1.php我运行这个PHP代码

<?php 
$Text=file_get_contents("http://inviatapenet.gethost.ro/sop/test.php");
 preg_match_all('~fid="(.*?)"~si',$Text,$Match);
 $fid=$Match[1][1];
 echo $fid;
?>

what i want to do is to get the text from test.php 我想要做的是从test.php获取文本

from this fid='gty5etrf' JavaScript an i need just the content of fid 从这个fid ='gty5etrf'JavaScript我只需要fid的内容

<script type='text/javascript'>fid='gty5etrf'; v_width=620; v_height=490;</script><script type='text/javascript' src='http://www.reyhq.com/player.js'></script>

in the test1.php i need to show only the content 在test1.php中，我只需要显示内容

gty5etrf

what i have to do? 我该怎么办？

Answer 1

您可以尝试表达式fid\\=\\'([^\\']+)\\'因为[^\\']+使表达式以非正确的方式非贪婪，同样，表达式错误，因为它正在寻找双引号而不是单引号。

Answer 2

 preg_match_all('/fid=\'([^\']+)\'/',$Text,$Match);

Your regex was wrong. 你的正则表达式错了。 First, you were looking for fid="..." instead of fid='...' . 首先，你在寻找fid="..."而不是fid='...' 。 Second, with .* , the regex would match any character further than the end of the fid attribute. 其次，使用.* ，正则表达式将匹配除fid属性末尾之外的任何字符。

Here is the full code : 这是完整的代码：

preg_match_all('/fid=\'([^\']+)\'/',$Text,$Match);
$fid=$Match[1][0];
echo $fid;

Answer 3

And this should be 这应该是

$fid=$Match[1][0];

instead of : 代替：

$fid=$Match[1][1];

Answer 4

Matching string inside '' : '(?:[^\\\\']*|\\\\.)*' 在'' ： '(?:[^\\\\']*|\\\\.)*'匹配字符串'(?:[^\\\\']*|\\\\.)*'

Matching string inside "" : "(?:[^\\\\"]*|\\\\.)*" 在""匹配字符串： "(?:[^\\\\"]*|\\\\.)*"

Both of them (ignoring spaces): fid\\s*=\\s*('(?:[^\\\\']*|\\\\.)*'|"(?:[^\\\\"]*|\\\\.)*") 两者（忽略空格）： fid\\s*=\\s*('(?:[^\\\\']*|\\\\.)*'|"(?:[^\\\\"]*|\\\\.)*")

And escaped for php: 并为php转义：

$regexp = '~fid\\s*=\\s*(\'(?:[^\\\\\']*|\\\\.)*\'|"(?:[^\\\\"]*|\\\\.)*")~';

This will handle correctly even this: 这将正确处理，即使这样：

fid  = 'foo\'s bar';

Answer 5

a short pattern: 一个简短的模式：

$pattern = '~\bfid\s*=\s*["\']\K\w+~';

or a long pattern: 或长模式：

$pattern = '~<script[^>]*>(?:[^f<]+|\Bf+|f(?!id\b)|<+(?!/script>))*+\bfid\s*=\s*(["\'])\K[^"\']+(?=\1)~';

the result with 结果

preg_match($pattern, $Text, $match);
$fid = $match[0];

The short pattern finds sequences like: 短模式找到如下序列：

fid='somechars
fid  = "somecchars

The long pattern does the same but also checks you are between script tags. 长模式执行相同但也检查脚本标记之间。

Using XPath: 使用XPath：

$html = <<<'EOD'
<script type='text/javascript'>fid='gty5etrf'; v_width=620; v_height=490;</script><script type='text/javascript' src='http://www.reyhq.com/player.js'></script>
EOD;

$dom = new DOMDocument;
libxml_use_internal_errors(true);
$dom->loadHTML($html);
$xp = new DOMXPath($dom);
$query = <<<'EOD'
    substring-before(
        substring-after(
            //script[contains(., "fid='")],
            "fid='"
        ),
        "'"
    )
EOD;

echo $xp->evaluate($query);

preg_match获取文本

问题描述

5 个解决方案

解决方案1
2 2013-04-16 09:32:38

解决方案2
2 已采纳 2013-04-16 09:33:09

解决方案3
0 2013-04-16 09:35:33

解决方案4
0 2013-04-16 09:37:50

解决方案5
0 2013-04-16 10:31:28

preg_match获取文本

问题描述

5 个解决方案

解决方案1 2 2013-04-16 09:32:38

解决方案2 2 已采纳 2013-04-16 09:33:09

解决方案3 0 2013-04-16 09:35:33

解决方案4 0 2013-04-16 09:37:50

解决方案5 0 2013-04-16 10:31:28

解决方案1
2 2013-04-16 09:32:38

解决方案2
2 已采纳 2013-04-16 09:33:09

解决方案3
0 2013-04-16 09:35:33

解决方案4
0 2013-04-16 09:37:50

解决方案5
0 2013-04-16 10:31:28