具有冲突令牌的 ANTLR 行为 [英] ANTLR behaviour with conflicting tokens

查看：26 发布时间：2021/11/11 4:08:15 antlr antlr4 lexer

本文介绍了具有冲突令牌的 ANTLR 行为的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在令牌冲突的情况下，ANTLR 词法分析器行为是如何定义的?让我解释一下我所说的冲突"令牌是什么意思.例如，假设定义了以下内容:

How is ANTLR lexer behavior defined in the case of conflicting tokens? Let me explain what I mean by "conflicting" tokens. For example, assume that the following is defined:

INT_STAGE       :   '1'..'6';
INT             :   '0'..'9'+;

这里有一个冲突，因为在读取了一系列数字后，词法分析器不知道是一个 INT 还是多个 INT_STAGE 标记(或两者的不同组合).经过测试，看起来如果 INT 在 INT_STAGE 之后定义，词法分析器更愿意找到 INT_STAGE，但也许不是 INT 呢?否则，将永远找不到 INT_STAGE.

There is a conflict here, because after reading a sequence of digits, the lexer would not know whether there is one INT or many INT_STAGE tokens (or different combinations of both). After a test, it looks like that if INT is defined after INT_STAGE, the lexer would prefer to find INT_STAGE, but maybe not INT then? Otherwise, no INT_STAGE would ever be found.

另一个例子是:

FOOL: ' fool'
FOO: 'foo'
ID              :   ('a'..'z'|'A'..'Z'|'_'|'%') ('a'..'z'|'A'..'Z'|'0'..'9'|'_'|'%')*;

有人告诉我这是识别所有令牌的正确"顺序:在阅读傻瓜"时，词法分析器会找到一个 FOOL 令牌，而不是 FOO ID 或其他东西.

I was told that this is the "right" order to recognize all the tokens: while reading "fool" the lexer will find one FOOL token and not FOO ID or something else.

具有冲突令牌的 ANTLR 行为 [英] ANTLR behaviour with conflicting tokens

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

具有冲突令牌的 ANTLR 行为 [英] ANTLR behaviour with conflicting tokens

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭