正则表达式从服务器端的html字符串中提取链接标记

Question

I have an HTML page source which is in string format on the server-side 我有一个HTML页面源，它在服务器端是字符串格式的

I need to extract a from the string and add it to an array.There can be multiple links with the same starting tag. 我需要从字符串中提取一个并将其添加到数组中。可以有多个具有相同开始标记的链接。 i need to push the extracted string to an array 我需要将提取的字符串推送到数组

the <link rel="icons"................ > can contain anything inside the tag.I have mentioned the startTag and endTag in the code below. <link rel="icons"................ >可以在标记内包含任何内容。我在下面的代码中提到了startTag和endTag。

  var startTag = '<link rel="icons"';
  var endTag = '>';
  const re = new RegExp('(' + startTag + ')(.|\n)+?(' + endTag + ')', 'g');

However, When i console the value of re, it is not the one I expect. 但是，当我管理re的值时，它不是我期望的值。

DesiredOutput DesiredOutput

['<link rel="icons" href="icons1.png"','<link rel="icons" href="icons2.png"',<link rel="icons" href="icons3.png"]

Thanks in advance. 提前致谢。

Answer 1

I think you're looking for something like this (the replace is just to remove extra whitespace): 我认为您正在寻找类似的东西（替换只是为了删除多余的空格）：

 const data = ` <link rel="icons" href="icons1.png" > <link rel="icons" href="icons2.png" > <link rel="icons" href="icons3.png" > `; const links = data.match(/<link.*?>/gs) .map(link => link.replace(/\\s+/g, ' ')); console.log(links);

If you're in an environment that doesn't support the s flag, you could use /<link[^]*?>/g instead. 如果您所处的环境不支持s标志，则可以改用/<link[^]*?>/g 。

正则表达式从服务器端的html字符串中提取链接标记

问题描述

1 个解决方案

解决方案1
0 2019-03-07 15:00:38

正则表达式从服务器端的html字符串中提取链接标记

问题描述

1 个解决方案

解决方案1 0 2019-03-07 15:00:38

解决方案1
0 2019-03-07 15:00:38