Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow unicode in all elements #960

Closed
kesara opened this issue Jan 18, 2023 · 9 comments · Fixed by #1017
Closed

Allow unicode in all elements #960

kesara opened this issue Jan 18, 2023 · 9 comments · Fixed by #1017
Labels
enhancement New feature or request

Comments

@kesara
Copy link
Member

kesara commented Jan 18, 2023

Please add, both here and below: dd, dt, li, blockquote, and any other block-level elements I missed. Then add, both here and below, the "inline" elements: cref (?), em, eref (?), iref (?), relref, strong, sub, sup, tt, and xref. (I didn't think much about the cross reference stuff, but I think they can contain text.)

I have immediate use for many of these. I can wrap in <t> for some of the block-level elements, but not the inline-level ones.

Originally posted by @martinthomson in #895 (comment)

@kesara kesara added the enhancement New feature or request label Jan 18, 2023
@cabo
Copy link
Contributor

cabo commented Jan 18, 2023

(Not just block elements.)

Note that the need to contort the syntax (insert <t>) to work around what is essentially an xml2rfc bug is unacceptable.

(But it is good that we are at the halfway house with this :-)

@rjsparks
Copy link
Member

Lets describe the scope better.
Do we need an explicit enumeration of elements to review, or are there groups that describe the set well enough?

@cabo
Copy link
Contributor

cabo commented Jan 19, 2023

[...] are there groups that describe the set well enough?

Yes: all elements.

There is no element in the RFCXMLv3 grammar that has the requirement to use <u.

@jrlevine
Copy link

@rjsparks Funny you should mention that. One of the unfinished tasks in 7991bis (or whatever we call it) is to clean up the set of elements so that elements with similar semantics allow the same kinds of contents.

@alicerusso
Copy link

alicerusso commented May 25, 2023

Here's an example of a document in queue where it would be useful to not have the current restriction. See Section 4.1.1 of draft-ietf-tcpm-rfc8312bis

If we put the list into <dl> -- In xml2rfc currently, the non-ASCII chars are allowed to be used inside <t> (without <u>). However, the same is not true for <dt> -- and <dt> cannot contain <t>.
So, if you put the desired char (β) in <dt>, xml2rfc outputs &#946; without warning you that it has given you bad output. (For background <contact> was used as a workaround in the original XML.)

Considering ways forward:

  • even though it's really a definition list, switch to use <ul> and put the term in <li>, then it will allow the char. Because <li> can contain <t>. for example:
      <ul>
        <li><t>β_<em><sub>cubic</sub></em>: CUBIC multiplicative decrease factor</t></li>
...
      </ul>
  • or wait for a change to allow usage of non-ASCII chars (without <u>) inside of <dt>.

@cabo
Copy link
Contributor

cabo commented May 25, 2023

There is no reason to confine the fix to just <t elements.
This initial step was just what we did do achieve some forward progress.
Instead, the artificial restrictions invented by xml2rfc need to removed altogether.

@rjsparks
Copy link
Member

Who's taking the pen to push the grammar regularization through rswg?

@jrlevine
Copy link

@rjsparks
I should but it's not going to happen until at least late June

@cabo
Copy link
Contributor

cabo commented May 30, 2023

Who's taking the pen to push the grammar regularization through rswg?

I'm not sure why this question is under this issue, as removing the misguided character set restrictions does not require any changes in the XML grammar.

@kesara kesara changed the title Allow unicode in all block-level elements Allow unicode in all elements Jul 10, 2023
kesara added a commit to kesara/xml2rfc that referenced this issue Jul 11, 2023
kesara added a commit to kesara/xml2rfc that referenced this issue Jul 27, 2023
kesara added a commit that referenced this issue Jul 31, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants