Antlr语法与预期的词法分析器规则不匹配

Question

I'm trying to match a duration string, like for 30 minutes or for 2 hours using the following rules: 我正在尝试使用以下规则匹配持续时间字符串，例如for 30 minutes或for 2 hours ：

durationPhrase: FOR_STR (MINUTE_DURATION | HOUR_DURATION);

MINUTE_DURATION: NONZERO_NUMBER MINUTE_STR;

HOUR_DURATION: NONZERO_NUMBER HOUR_STR;

MINUTE_STR: 'minute'('s')?;

HOUR_STR: 'hour'('s')?;

FOR_STR: 'for';

NONZERO_NUMBER: [0-9]+;

WS: (' '|[\n\t\r]) -> skip;

With the following input: 输入以下内容：

for 30 minutes

Attempting to debug/match the durationPhrase rule, I'm presented with the error: 尝试调试/匹配durationPhrase规则时，出现以下错误：

line 1:4 mismatched input '30' expecting {MINUTE_DURATION, HOUR_DURATION}

But I can't seem to figure out what lexer rule the '30' is matching? 但是我似乎无法弄清楚“ 30”匹配的词法规则是什么？ I was under the impression the "longest" lexer rule would win, which would be the MINUTE_DURATION rule. 我印象中“最长的”词法分析器规则将获胜，这将是 MINUTE_DURATION规则。

Is it instead matching NONZERO_NUMBER first? 而是首先匹配NONZERO_NUMBER吗？ And if so, why? 如果是这样，为什么？

Answer 1

It's matching NONZERO_NUMBER because neither of the other patterns apply. 它与NONZERO_NUMBER匹配，因为其他任何模式均不适用。 If you had entered 30minutes , it would have matched MINUTE_DURATION , but as a token pattern, MINUTE_DURATION won't match the space character. 如果您输入了30minutes ，则它将与MINUTE_DURATION相匹配，但作为令牌模式， MINUTE_DURATION将与空格字符不匹配。

You ignore whitespace by applying -> skip to the token WS . 您可以通过-> skip令牌WS来忽略空格。 That can only happen after WS is recognised as a token; 只有在将WS识别为令牌后才能发生这种情况。 ie after tokenisation. 即在标记化之后。 During tokenisation, whitespace characters are just characters. 在标记化过程中，空格字符只是字符。

If you make MINUTE_DURATION and HOUR_DURATION syntax rules rather than lexical rules, it should work as expected. 如果您制定MINUTE_DURATION和HOUR_DURATION语法规则而不是词法规则，则它应能按预期工作。

Antlr语法与预期的词法分析器规则不匹配

问题描述

1 个解决方案

解决方案1
4 已采纳 2018-06-25 13:43:34

Antlr语法与预期的词法分析器规则不匹配

问题描述

1 个解决方案

解决方案1 4 已采纳 2018-06-25 13:43:34

解决方案1
4 已采纳 2018-06-25 13:43:34