Pandoc过滤器pandoc中的字符escaping.Para Lua function

Question

I'm using Pandoc for a large HTML to Markdown conversion project and am trying to write lua filters to handle some of the special cases.我正在将 Pandoc 用于大型 HTML 到 Markdown 转换项目，并且正在尝试编写 lua 过滤器来处理一些特殊情况。

The most common case I am trying to handle is converting specially formatted information boxes into the pymarkdown summary/detail formatting .我试图处理的最常见的情况是将特殊格式的信息框转换为 pymarkdown 摘要/详细信息格式。

Source HTML来源 HTML

<div class="special-info-block">
  <p class="title">INFO</p>
</div>

Goal Markdown目标 Markdown

???+ info "INFO"

I can use this function to replace the "INFO":我可以使用这个 function 来替换“INFO”：

function Div(el)
   if el.classes[2] == "special-info-block" and pandoc.utils.stringify(el.content[1]) == "INFO" then
      el.content[1] = pandoc.Para('??? info "INFO"+')
      return el
   end
end

but the resulting markdown escapes the quotation marks around INFO:但生成的 markdown 转义了 INFO 周围的引号：

???+ info \"INFO\"

How do I insert the literal string instead?如何改为插入文字字符串？ Is this a feature of the pandoc.Para constructor or should I be looking elsewhere?这是 pandoc.Para 构造函数的一个特性还是我应该在别处寻找？

Answer 1

The escaping happens during Markdown generation, so there are two options here: escaping 发生在 Markdown 生成期间，因此这里有两个选项：

Call pandoc with -t markdown-smart , which will instruct the Markdown writer to treat quotes as normal chars;使用-t markdown-smart调用 pandoc，这将指示 Markdown 编写器将引号视为普通字符；
Create a raw Markdown block instead of a Para to get maximal control over the output: el.content[1] = pandoc.RawBlock('markdown', '??? info "INFO"+') .创建原始 Markdown 块而不是 Para 以获得对 output 的最大控制： el.content[1] = pandoc.RawBlock('markdown', '??? info "INFO"+') 。

Both methods these should give the desired result, but the second is probably preferable.这两种方法都应该给出预期的结果，但第二种方法可能更可取。

Pandoc过滤器pandoc中的字符escaping.Para Lua function

问题描述

1 个解决方案

解决方案1
0 已采纳 2020-12-17 15:05:57

Pandoc过滤器pandoc中的字符escaping.Para Lua function

问题描述

1 个解决方案

解决方案1 0 已采纳 2020-12-17 15:05:57

解决方案1
0 已采纳 2020-12-17 15:05:57