[英]How to remove empty strings in custom arangosearch analyzer
I've got a custom analyzer that is like text_en but doesn't include the hyphen as a delimiter:我有一个自定义分析器,类似于 text_en 但不包含连字符作为分隔符:
{pipeline:[
{type:"norm",properties:{
locale: "en.utf-8", accent: false, case: "lower", stemming: false}},
{type:"delimiter",properties:{delimiter:" "}},
{type:"delimiter",properties:{delimiter:"!"}},
{type:"delimiter",properties:{delimiter:"."}},
{type:"delimiter",properties:{delimiter:","}},
{type:"delimiter",properties:{delimiter:";"}},
{type:"delimiter",properties:{delimiter:"?"}},
{type:"delimiter",properties:{delimiter:"["}},
{type:"delimiter",properties:{delimiter:"]"}},
{type:"delimiter",properties:{delimiter:"{"}},
{type:"delimiter",properties:{delimiter:"}"}},
{type:"delimiter",properties:{delimiter:"("}},
{type:"delimiter",properties:{delimiter:")"}},
{type:"delimiter",properties:{delimiter:"<"}},
{type:"delimiter",properties:{delimiter:">"}},
{type:"delimiter",properties:{delimiter:"~"}},
{type:"delimiter",properties:{delimiter:"@"}},
{type:"delimiter",properties:{delimiter:"="}},
{type:"delimiter",properties:{delimiter:"&"}},
{type:"delimiter",properties:{delimiter:"|"}},
{type:"delimiter",properties:{delimiter:"\n"}},
{type:"stem",properties:{locale:"en.utf-8"}}]}'
The issue is that chaining like this will return empty strings.问题是这样的链接将返回空字符串。 The tokens for the string "HYPNOS2, Aphrodite and other Microcontrollers."
字符串“HYPNOS2、Aphrodite 和其他微控制器”的标记。 is this:
这是:
[
"hypnos2",
"",
"aphrodit",
"and",
"other",
"microcontrol",
""
]
What do I do to remove the empty string tokens in this analyzer?我该怎么做才能删除此分析器中的空字符串标记?
I added another stage in the pipeline after the delimiters.我在分隔符之后在管道中添加了另一个阶段。 It's an aql with keepNull=false and queryString of return @param==""?null:@param.
这是一个带有keepNull=false 和返回@param==""?null:@param 的queryString 的aql。 If someone has a simpler suggestion I'm interested.
如果有人有更简单的建议,我很感兴趣。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.