删除脚本中的 HTML 标签

Question

I've found this piece of code on the internet.我在互联网上找到了这段代码。 It takes a sentence and makes every single word into link with this word.它需要一个句子，并将每个单词与这个单词联系起来。 But it has weak side: if a sentence has HTML in it, this script doesn't remove it.但它有弱点：如果一个句子中有 HTML，这个脚本不会删除它。

For example: it replaces ' asserted ' with ' http://www.merriam-webster.com/dictionary/asserted '例如：它将“ asserted ”替换为“ http://www.merriam-webster.com/dictionary/asserted ”

Could you please tell me what to change in this code for it to change ' asserted ' to ' http://www.merriam-webster.com/dictionary/asserted '.您能否告诉我在此代码中进行哪些更改以将“ asserted ”更改为“ http://www.merriam-webster.com/dictionary/asserted ”。

var content = document.getElementById("sentence").innerHTML;

var punctuationless = content.replace(/[.,\/#!$%\؟^?&\*;:{}=\-_`~()”“"]/g, "");
var mixedCase = punctuationless.replace(/\s{2,}/g);
var finalString = mixedCase.toLowerCase();

var words = (finalString).split(" ");

var punctuatedWords = (content).split(" ");

var processed = "";
for (i = 0; i < words.length; i++) {
    processed += "<a href = \"http://www.merriam-webster.com/dictionary/" + words[i] + "\">";
    processed += punctuatedWords[i];
    processed += "</a> ";
}

document.getElementById("sentence").innerHTML = processed;

Answer 1

This regex /<{1}[^<>]{1,}>{1}/g should replace any text in a string that is between two of these <> and the brackets themselves with a white space.此正则表达式 /<{1}[^<>]{1,}>{1}/g 应该用空格替换其中两个 <> 和括号本身之间的字符串中的任何文本。 This这

 var str = "<hi>How are you<hi><table><tr>I<tr><table>love cake<g>" str = str.replace(/<{1}[^<>]{1,}>{1}/g," ") document.writeln(str);

will give back " How are you I love cake".会回馈“你好吗，我喜欢蛋糕”。

If you paste this如果你粘贴这个

var stripHTML = str.mixedCase(/<{1}[^<>]{1,}>{1}/g,"")

just below this就在这下面

var mixedCase = punctuationless.replace(/\s{2,}/g);

and replace mixedCase with stripHTML in the line after, it will probably work并在后面的行中用 stripHTML 替换混合大小写，它可能会起作用

Answer 2

function stripAllHtml(str) {
  if (!str || !str.length) return ''

  str = str.replace(/<script.*?>.*?<\/script>/igm, '')

  let tmp = document.createElement("DIV");
  tmp.innerHTML = str;

  return tmp.textContent || tmp.innerText || "";
}

stripAllHtml('<a>test</a>')

This function will strip all the HTML and return only text.此函数将删除所有 HTML 并仅返回文本。

Hopefully, this will work for you希望这对你有用

Answer 3

if you need to remove HTML tags And HTML Entities You can use如果您需要删除 HTML 标签和 HTML 实体，您可以使用

const text = '<p>test content </p><p><strong>test bold</strong>&nbsp;</p>'
text.replace(/<[^>]*(>|$)|&nbsp;|&zwnj;|&raquo;|&laquo;|&gt;/g, '');

the result will be "test content test bold"结果将是“测试内容测试粗体”

删除脚本中的 HTML 标签

问题描述

3 个解决方案

解决方案1
5 已采纳 2016-10-18 10:23:10

解决方案2
1 2016-10-18 09:37:08

解决方案3
0 2020-09-17 11:56:24

删除脚本中的 HTML 标签

问题描述

3 个解决方案

解决方案1 5 已采纳 2016-10-18 10:23:10

解决方案2 1 2016-10-18 09:37:08

解决方案3 0 2020-09-17 11:56:24

解决方案1
5 已采纳 2016-10-18 10:23:10

解决方案2
1 2016-10-18 09:37:08

解决方案3
0 2020-09-17 11:56:24