ANTLR词法分析器规则消耗过多 [英] ANTLR lexer rule consumes too much

查看：102 发布时间：2020/9/2 23:54:41 antlr grammar antlr3

本文介绍了ANTLR词法分析器规则消耗过多的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

ANTLR Lexer规则设计

ANTLR Lexer Rule Design

我需要以下令牌:

允许的字符包括大写，小写，数字，空格和连字符
不固定长度(必须至少两个字符)
令牌必须至少包含一个空格或连字符
令牌必须以大写，小写，数字，空格或连字符开头和结尾(不能以空格开头或结尾)

以下语法中的ANTLR词法分析器规则"AlphaNumericSpaceHyphen"几乎有效，除了一种情况.使用解析器规则"sic"进行测试，以下输入将被解析(不带引号):

The ANTLR lexer rule "AlphaNumericSpaceHyphen" in the grammar below almost works except for one case. Using the parser rule "sic" to test, the following input will parse (without quotes):

标准工业分类:水运输[4400]"

"STANDARD INDUSTRIAL CLASSIFICATION: WATER TRANSPORTATION[4400]"

以下输入无法解析(不带引号):

The following input fails to parse (without quotes):

标准工业分类:水运输[4400]"

"STANDARD INDUSTRIAL CLASSIFICATION: WATER TRANSPORTATION [4400]"

问题是词法分析器规则"AlphaNumericSpaceHyphen"在词法分析器意识到没有匹配项是因为它走得太远之前，在"WATER TRANSPORTATION"之后占用了空间和左方括号.

The issue being that the lexer rule "AlphaNumericSpaceHyphen" consumes the space and the left square bracket after "WATER TRANSPORTATION" before the lexer realizes that there is no match because it went too far.

我已经尝试过各种类型的谓词，并且在没有任何运气的情况下向前看.任何帮助将不胜感激.

I have experimented with various type of predicates and look aheads without any luck. Any help is greatly appreciated.

grammar T;

sic: SICSpecifier AlphaNumericSpaceHyphen  LEFTBRACKET Digits RIGHTBRACKET;

LEFTBRACKET  
:   '[';  

RIGHTBRACKET 
:   ']';

SICSpecifier: 'STANDARD INDUSTRIAL CLASSIFICATION:';

WS : (' '|'\t')+ 
{   
  $channel = HIDDEN;  
};  

fragment UCASEALPHA : 'A'..'Z';
fragment LCASEALPHA : 'a'..'z';
fragment DIGIT : '0'..'9';
Digits: DIGIT+;

AlphaNumericSpaceHyphen 
:           (UCASEALPHA|LCASEALPHA |DIGIT|'-')+  (' ' (UCASEALPHA|LCASEALPHA |DIGIT|'-')+)+   
        |   (UCASEALPHA|LCASEALPHA |DIGIT)+ ('-')+  ((' '|UCASEALPHA|LCASEALPHA |DIGIT|'-')* (UCASEALPHA|LCASEALPHA |DIGIT|'-'))?
        |   ('-')+ (UCASEALPHA|LCASEALPHA |DIGIT)+  ((UCASEALPHA|LCASEALPHA |DIGIT|'-'|' ')* (UCASEALPHA|LCASEALPHA |DIGIT|'-'))?   
        ;

ANTLR词法分析器规则消耗过多 [英] ANTLR lexer rule consumes too much

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

ANTLR词法分析器规则消耗过多 [英] ANTLR lexer rule consumes too much

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭