Towards a more flexible and accurate model for grammar levels and associativity #71

herbelin · 2023-07-22T17:23:26Z

Rendered: https://github.com/herbelin/ceps/blob/camlp5-associativity/text/camlp5-associativity.md

silene · 2023-07-24T05:55:40Z

text/camlp5-associativity.md

+- to renounce to Camlp5 support for levels, that is, to manage manually the inclusion of levels and the associativity of operators using distinct entries
+- to fix the problems of levels and associativity in Camlp5
+
+The first direction would a priori require more work and more support of Coq to manage itself the levels and associativity. On the other side, an advantage is that it would then probably be easier to experiment with different parsing engines.


So that it does not get lost, here is an LL(1) grammar I posted on Zulip that supports having multiple (non) associative operators at the same level, but causes a parsing error if the user tries to mix them in ambiguous way in a single expression, e.g., x opR y opL z. (L, N, R respectively stand for left-, non-, right-associative.)

term42: [ x = term41; y = term42_no_left -> y x | x = term41; y = term42_no_right -> y x | x = term41; "opN"; y = term41 -> OpN(x,y) | x = term41 -> x ] term42_no_left: [ "opR"; x' = term41; y = term42_no_left -> fun x => OpR (x, y x') ] term42_no_right: [ "opL"; x' = term41; y = term42_no_right -> fun x => y (opL (x, x')) | "opL'"; x' = term41; y = term42_no_right -> fun x => y (opL' (x, x')) ]

silene · 2023-07-24T06:13:37Z

text/camlp5-associativity.md

+The drawback is that we need to modify the printer so that it knows how to print such cycle made of a rule at some level (here 10) which is lower than the level on the right-hand side of the rule (here 200). This is however already made possible by PR #17875.
+
+For `-`, there are several solutions, such as:
+- to set it at level 10 with arguments at level 40 (the level of `*`) so that `2 ^ - x * y` is `2 ^ - (x * y)` but `2 ^ - x + y` is `(2 ^ - x) + y` (??)


Parsing 2 ^ - x * y as 2 ^ (- (x * y)) seems to be quite surprising. I would expect the consensus among mathematicians to be (2 ^ (- x)) * y.

In other words, there is no reason for the argument of prefix "-" to be at level 40. It should be at the current level (i.e., 35 if the argument of infix "^". In fact, the notion of levels for a notation that starts with a terminal and ends with a non-terminal does not make sense in the first place. It should not have an intrinsic level, it should always use the ambient level.

I got confused by the idea that - x * y was - (x * y) by default in mathematics, but that's the same as (- x) * y, so it does not have to be the former. And actually, most programming languages seem to make prefix - binding tightly (so morally at 10 for Coq).

For the example I mentioned, the level of x in - x do not matter. The whole notation could be at 10 with x at 10 too.

What do you mean by "It should not have an intrinsic level, it should always use the ambient level.". That is, in the following examples, how would you associate (or fail):

x * - y

x ^ - y

x + - y

f x - y

- f * x

- f ^ x

- f + x

- f x

Adding a link: https://en.wikipedia.org/wiki/Order_of_operations#Unary_minus_sign

And actually, most programming languages seem to make prefix - binding tightly (so morally at 10 for Coq).

That does not mean that it is a good idea. In fact, the very first sentence of the Wikipedia page you cite says that - 3 ^ 2 is interpreted as -9 in mathematics. And the only "languages" it mentions as binding tightly are Excel, PlanMaker, and bc.

So, having unary minus at level 10 would be awful. And having it at level 35 is already painful. I hate having to write - (x * y). It should have been at level 45.

x * - y

x ^ - y

x + - y

All three of them associate as x ! (- y).

f x - y

Do you have an infix minus in the context? If not, then parsing can be recovered as f x (- y).

But the interesting case is actually f x - y z, which would be parsed as f x (- y) z (assuming again that there is no infix minus, otherwise the point is moot).

- f * x

- f ^ x

- f + x

- f x

You did not say what the ambient level is, so it depends. Let us assume that those expressions all look like y + - f ! x (so, ambient level is 50 left associative). Then they would respectively parse as

y + (- (f * x))

y + (- (f ^ x))

(y + (- f)) + x

y + (- (f x))

(I'm tired, the first 3 questions were pointless. I meant the minus sign in the front)

I don't see what conclusion to draw otherwise. And we are somehow constrained by what is currently done, which is:

Require Import ZArith. Context (x y :Z) (f : Z -> Z) (g : Z). Set Printing Parentheses. Open Scope Z_scope. Check x * - f y. (* x * (- (f y)) *) (* due to recovery *) Check x ^ - f y. (* x ^ (- (f y)) *) (* due to recovery *) Check x + - f y. (* x + (- (f y) *) (* due to recovery *) Check f x - y. (* (f x) - y *) Check - g * x. (* (- g) * x *) Check - g ^ x. (* - (g ^ x) *) Check - g + x. (* (- g) ^ x *) Check - f x. (* - (f x) *) Check - x * y. (* (- x) * y *) Check - x ^ y. (* - (x ^ y) *) Check - x + y. (* (- x) + y *)

For the record, experimenting changing associativity of:

- x * y from (- x) * y to - (x * y) and

- x / y from (- x) / y to - (x / y)

requires several changes in stdlib, starting with:

Ring_theory.Ropp_mul_l : -(x * y) == -x * y (also in Ncring.v) and Ring_theory.ARopp_mul_l : -(x * y) == (-x) * y

BinInt.Zopp_mult_distr_l : - (n * m) = - n * m

Zdivfloor.div_opp_opp : b~=0 -> -a/-b == a/b

Field_theory.rdiv5 : - (a / b) == - a / b

Uint63.opp_spec : φ (- x) = - φ x mod wB

-a÷b in Zquote.v

- Qnum x * Zpos (Qden y) in QArith_base.v

non exhaustive list...

CEP on the treatment of levels and associativity in the parser

f3c5db2

silene reviewed Jul 24, 2023

View reviewed changes

herbelin mentioned this pull request Sep 6, 2023

Moving binder_constr at level 10 in anticipation of removing gramlib fallback coq/coq#18014

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Towards a more flexible and accurate model for grammar levels and associativity #71

Towards a more flexible and accurate model for grammar levels and associativity #71

herbelin commented Jul 22, 2023 •

edited

Loading

silene Jul 24, 2023

silene Jul 24, 2023

herbelin Jul 24, 2023 •

edited

Loading

silene Jul 24, 2023

herbelin Jul 24, 2023

herbelin Sep 6, 2023

Towards a more flexible and accurate model for grammar levels and associativity #71

Are you sure you want to change the base?

Towards a more flexible and accurate model for grammar levels and associativity #71

Conversation

herbelin commented Jul 22, 2023 • edited Loading

silene Jul 24, 2023

Choose a reason for hiding this comment

silene Jul 24, 2023

Choose a reason for hiding this comment

herbelin Jul 24, 2023 • edited Loading

Choose a reason for hiding this comment

silene Jul 24, 2023

Choose a reason for hiding this comment

herbelin Jul 24, 2023

Choose a reason for hiding this comment

herbelin Sep 6, 2023

Choose a reason for hiding this comment

herbelin commented Jul 22, 2023 •

edited

Loading

herbelin Jul 24, 2023 •

edited

Loading